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The Run II jet physics group includes the Jet Algorithms, Jet Shape/Energy Flow, and Jet Measurements/Correlations 
subgroups. The main goal of the jet algorithm subgroup was to explore and define standard Run II jet finding procedures for 
CDF and D0. The focus of the jet shape/energy flow group was the study of jets as objects and the energy flows around these 
objects. The jet measurements/correlations subgroup discussed measurements at different beam energies; as measurements; 
and LO, NLO, NNLO, and threshold jet calculations. As a practical matter the algorithm and shape/energy flow groups 
merged to concentrate on the development of Run II jet algorithms that are both free of theoretical and experimental 
difficulties and able to reproduce Run I measurements. 

Starting from a review of the experience gained during Run I, the group considered a variety of cone algorithms and Kt 
algorithms. The current understanding of both types of algorithms, including calibration issues, are discussed in this report 
along with some preliminary experimental results. The jet algorithms group recommends that CDF and D0 employ the 
same version of both a cone algorithm and a Kt algorithm during Run II. Proposed versions of each type of algorithm are 
discussed. The group also recommends the use of full 4-vector kinematic variables whenever possible. The recommended 
algorithms attempt to minimize the impact of seeds in the case of the cone algorithm and preclustering in the case of the 
Kt algorithm. Issues regarding precluster definitions and merge/split criteria require further study. 
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1. Prologue 

The Run I jet programs at CDF and D0 made im- 
pressive measurements of the inclusive jet cross sec- 
tion, dijet angular and mass distributions, and triple 
differential cross sections. These measurements were 
all marked by statistical accuracy equal or superior to 
current theoretical accuracy [ [p. However, the always 
compelling search for quark compositeness, the quest 
to improve the calculational accuracy of QCD, and 
the desire to fully understand the composition of the 
proton will certainly prompt improvements over these 
measurements. Without question, with ~2 the 
Run II jet physics program will extend the jet mea- 
surements of Run I to even higher jet energies. 

There are three issues, experimental and theoreti- 
cal, that currently limit the sensitivity of composite- 
ness searches and QCD tests: limited knowledge of the 
parton distribution functions (pdfs) , systematic uncer- 



tainties related to jet energy calibration, and the lim- 
ited accuracy of fixed order perturbative calculations 
due to the incomplete nature of the calculations and 
incomplete specification of jet finding algorithms. In- 
adequate knowledge of the pdfs and calibration are cur- 
rently the dominant uncertainties, engendering greater 
than 50% uncertainties at the largest energies. The 
reader may refer to the chapter on Parton Distribu- 
tions for a complete discussion of pdf measurements. 

As mentioned, the uncertainty of NLO perturba- 
tive calculations is due in part to the inherent incom- 
pleteness of fixed order calculations. The initial meet- 
ing of the jet physics group included talks on "Lead- 
ing Order (LO) Multi-jet Calculations" by Michelan- 
gelo Mangano, "Next-to-Leading Order (NLO) Multi- 
jet Calculations" by Bill Kilgore, "Prospects for Next- 
to-NLO (NNLO) Multi-jet Calculations" by Lance 
Dixon, "Threshold Resummations for Jet Production" 
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by Nicolas Kidonakis, "Different Beam Energies" by 
Greg Snow, and "as Measurements in Jet Systems" by 
Christina Mesropian. These attempts to improve the 
accuracy of perturbative calculations show the vigor- 
ous nature of ongoing efforts and should prove fruitful 
before the arrival of Run II data. 

Jet algorithms, the other source of calculation uncer- 
tainty, start from a list of "particles" that we take to 
be calorimeter towers or hadrons at the experimental 
level, and partons in a perturbative QCD calculation. 
The role of the algorithm is to associate clusters of 
these particles into jets such that the kinematic prop- 
erties of the jets (e.g., momenta) can be related to the 
corresponding properties of the energetic partons pro- 
duced in the hard scattering process. Thus the jet 
algorithm allows us to "see" the partons (or at least 
their fingerprints) in the hadronic final state. 

Differences in the properties of reconstructed jets 
when going from the parton to the hadron or calorime- 
ter level are a major concern for a good jet algorithm. 
Each particle i carries a 4-momentum , which we 
take to be massless. The algorithm selects a set of par- 
ticles, which are typically emitted close to each other 
in angle, and combines their momenta to form the mo- 
mentum of a jet. The selection process is called the 
"jet algorithm" and the momentum addition rule is 
called the "recombination scheme". Note that these 
two steps are logically distinct. One can, for example, 
use one set of kinematic variables in the jet algorithm 
to determine the particles in a jet and then construct 
a separate set of kinematic variables to characterize 
the jets that have been identified. This point will be 
important in subsequent discussions. 

Historically cone algorithms have been the jet al- 
gorithm of choice for hadron-hadron experiments. As 
envisioned in the Snowmass algorithm [ p|, a cone jet of 
radius R consists of all of the particles whose trajecto- 
ries (assuming no bending by the magnetic field of the 
detector) lie in an area A = irR 2 of r\ x (f> space, where 
rj is the pseudorapidity r\ = — In tan 8/2. It is further 
required, as explained in detail below, that the axis of 
the cone coincides with the jet direction as defined by 
the -&r-weighted centroid of the particles within the 
cone (where Et is transverse energy, Et = Esin8). 
In principle, one simply searches for all such "stable" 
cones to define the jet content of a given event. 

In practice, in order to save computing time, the it- 
erative process of searching for the "stable" cones in 
experimental data starts with only those cones cen- 
tered about the most energetic particles in the event 
(the so-called "seeds"). Usually, the seeds are required 
to pass a threshold energy of a few hundred MeV in 
order to minimize computing time. The i/^-weighted 
centroids are calculated for the particles in each seed 



cone and then the centroids are used as centers for 
new cones in 77 x <p space. This procedure is iterated 
for each cone until the cone axis coincides with the cen- 
troid. Unfortunately, nothing prevents the final stable 
cones from overlapping. A single particle may belong 
to two or more cones. As a result, a procedure must be 
included in the cone algorithm to specify how to split 
or merge overlapping cones [ |3| . 

At least part of the uncertainty associated with fixed 
order perturbative calculations of jet cross sections can 
be attributed to the difficulties encountered when this 
experimental jet cone algorithm, with both seeds and 
merging/splitting rules, is applied to theoretical cal- 
culations. (See Ref. [ for a discussion of the CDF 
and D0 algorithms.) Neither issue was treated by the 
original Snowmass algorithm [ || that forms the ba- 
sis of fixed order perturbative cone jet calculations. 
Current NLO inclusive jet cross section calculations 
(which describe either two or three final state partons) 
require the addition of an ad hoc parameter Rsep [ @|- 
This additional parameter is used to regulate the clus- 
tering of partons and simulate the role of seeds and 
merging in the experimentally applied algorithm. In 
essence, the jet cone algorithm, used so pervasively at 
hadron-hadron colliders, must be modeled in NLO cal- 
culations. This modeling results in 2-5% uncertainties 
as a function of jet transverse energy Et in calculated 
cross sections. 

Even worse, with the current cone algorithms, cross 
sections calculated at NNLO exhibit a marked sensi- 
tivity to soft radiation. As an illustration, consider 
two well-separated partons that will just fit inside, but 
at opposite sides, of a single cone. With only the two 
partons, and nothing in between to serve as a seed, the 
current standard cone algorithms will reconstruct the 
two partons as two jets. At NNLO a very soft gluon 
could be radiated between the two well-separated par- 
tons and serve as a seed. In this case the single jet 
solution, with both partons inside, will be identified 
by the current cone algorithm. Thus the outcome of 
the current cone algorithm with seeds is manifestly sen- 
sitive to soft radiation. Because of the difficulties in- 
herent with typical usage of the cone algorithm, the 
jet algorithm and jet shape/energy flow subgroups de- 
cided to establish an Improved Legacy Cone Algorithm 
(whimsically dubbed ILCA). Ideally, the ILCA should 
replicate Run I cross sections within a few percent, but 
not have the same theoretical difficulties. 

Inspired by QCD, a second class of jet algorithms, 
Kt algorithms, has been developed. These algo- 
rithms successively merge pairs of "particles" in or- 
der of increasing relative transverse momentum. They 
typically contain a parameter, D (also called R), 
that controls termination of merging and character- 
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izes the approximate size of the resulting jets. Since a 
Kt algorithm fundamentally merges nearby particles, 
there is a close correspondence of jets reconstructed 
in a calorimeter to jets reconstructed from individual 
hadrons, leptons and photons. Furthermore, every par- 
ticle in an event is assigned to a unique jet. Most im- 
portantly, Kt jet algorithms are, by design, infrared 
and collinear safe to all orders of calculation. The al- 
gorithms can be applied in a straightforward way to 
fixed-order or resummed calculations in QCD, partons 
or particles from a Monte Carlo event generator, or en- 
ergy deposited in a detector [ || . 

However, until recently, a full program for the cal- 
ibration of Kt algorithms at hadron-hadron colliders 
had not been developed. This was due mostly to dif- 
ficulties with the subtraction of energy from specta- 
tor fragments and from the pile-up of multiple hadron- 
hadron interactions. Since the Kt jets have no fixed 
shape, prescriptions for dealing with the extra energy 
have been difficult to devise and the use of K T al- 
gorithms at hadron-hadron colliders has been limited. 
Also, as with the issue of seeds in the case of the cone 
algorithm, there is a practical question of minimizing 
the computing time required to apply the Kt algo- 
rithm. Typically this is treated in a preclustering step 
where the number of "particles" is significantly reduced 
before the Kt algorithm is applied. A successful Kt 
algorithm must ensure that any preclustering step does 
not introduce the sort of extra difficulty found with 
seeds. 

Buoyed by the successful use of K T algorithms at 
LEP and HERA, eager to benefit from their theoret- 
ical preciseness, and reassured by recent success with 
calibration, the jet physics group decided to specify a 
standard Kt algorithm for Run II. 

2. Attributes of the Ideal Algorithm 

Although it provided a good start, the Snowmass 
algorithm has proved to be incomplete. It does not 
address either the phenomena of merging and splitting 
or the role of the seed towers with the related soft gluon 
sensitivity. Also, jet energy and angle definitions have 
varied between experiments. To treat these issues, the 
group began discussions with the following four general 
criteria: 

1. Fully Specified: The jet selection process, the jet 
kinematic variables and the various corrections 
(e.g., the role of the underlying event) should 
be clearly and completely defined. If necessary, 
preclustering, merging, and splitting algorithms 
must be completely described. 



2. Theoretically Well Behaved: The algorithm 
should be infrared and collinear safe with no ad 
hoc clustering parameters. 

3. Detector Independence: There should be no de- 
pendence on cell type, numbers, or size. 

4. Order Independence: The algorithms should be- 
have equally at the parton, particle, and detector 
levels. 

The first two criteria should be satisfied by every 
algorithm; however, the last two can probably never 
be exactly true, but should be approximately correct. 

2.1. Theoretical Attributes of the Ideal 
Algorithm 

The initial efforts of the algorithm working group 
were focused on extending and illuminating the list of 
desirable features of an "ideal" jet algorithm. From 
the "theoretical standpoint" the following features are 
desirable and, for the most part, necessary: 

1. Infrared safety: The algorithm should not only 
be infrared safe, in the sense that any infrared 
singularities do not appear in the perturbative 
calculations, but should also find solutions that 
are insensitive to soft radiation in the event. As 
illustrated in Fig. [I], algorithms that look for jets 
only around towers that exhibit some minimum 
energy activity, called seed towers or just seeds, 
can be quite sensitive to soft radiation. The ex- 
perimental cone algorithms employed in previous 
runs have such seeds. 

2. Collinear safety: The algorithm should not only 
be collinear safe, in the sense that collinear sin- 
gularities do not appear in the perturbative cal- 
culations, but should also find jets that are in- 
sensitive to any collinear radiation in the event. 

A) Seed-based algorithms will in general break 
collinear safety until the jets are of sufficiently 
large Et that splitting of the seed energy be- 
tween towers does not affect jet finding (See 
Fig. ||). This was found to be the case for jets 
above 20 GeV in the D0 data, where jets were 
found with 100% efficiency using a seed tower 
threshold of 1.0 GeV [ |(|. The collinear depen- 
dence introduced via the seed threshold is re- 
moved when the jets have sufficient Et to be 
reconstructed with 100% efficiency. 

B) Another possible collinear problem can arise 
if the algorithm is sensitive to the Et ordering 
of particles. An example would be an algorithm 
where a) seeds are treated in order of decreasing 
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Et and b) a seed is removed from the seed list 
when it is within a jet found using a seed that 
is higher on the list. For such an algorithm con- 
sider the configuration illustrated in Fig. [| The 
difference between the two situations is that the 
central (hardest) parton splits into two almost 
collinear partons. The separation between the 
two most distant partons is more than R but less 
than 2R. Thus all of the partons can fall within 
a single cone of radius R around the central par- 
ton(s). However, if the partons are treated as 
seeds and analyzed with the candidate algorithm 
suggested above, different jets will be identified 
in the two situations. On the left, where the sin- 
gle central parton has the largest Et, a single 
jet containing all three partons will be found. In 
the situation on the right, the splitting of the 
central parton leaves the right-most parton with 
the largest E T . Hence this seed is looked at first 
and a jet may be found containing only the right- 
most and two central partons. The left-most par- 
ton is a jet by itself. In this case the jet number 
changes depending on the presence or absence of 
a collinear splitting. This signals an incomplete 
cancellation of the divergences in the real and vir- 
tual contributions to this configuration and ren- 
ders the algorithm collinear unsafe. While the al- 
gorithm described here is admittedly an extreme 
case, it is not so different from some schemes used 
in Run I. Clearly this problem should be avoided 
by making the selection or ordering of seeds and 
jet cones independent of the Et of individual par- 
ticles. 

3. Invariance under boosts: The algorithm should 
find the same solutions independent of boosts in 
the longitudinal direction. This is particularly 
important for pp collisions where the center-of- 
mass of the individual parton-parton collisions is 
typically boosted with respect to the pp center- 
of-mass. This point was emphasized in conversa- 
tions with the Jet Definition Group Les Houches [ 

m 

4. Boundary Stability: It is desirable that the kine- 
matic variables used to describe the jets exhibit 
kinematic boundaries that are insensitive to the 
details of the final state. For example, the scalar 
Et variable, explained in more detail in the next 

lr The Les Houches group discussed jet algorithms for both the 
Tevatron and LHC, and they sharpened their algorithm re- 
quirements by also requiring boundary stability (the kinematic 
boundary for the one jet inclusive jet cross section should be at 
the same place, Et = independent of the number of fi- 

nal state particles), suitability for soft gluon summations of the 
theory, and simplicity and elegance. 



section, has a boundary that is sensitive to the 
number of particles present and their relative an- 
gle (i.e., the boundary is sensitive to the mass of 
the jet). The bound E1^ ax — yfs/2 applies only 
for collinear particles and massless jets. In the 
case of massive jets the boundary for Et is larger 
than t/s/2- Boundary stability is essential in or- 
der to perform soft gluon summations. 

5. Order Independence: The algorithm should find 
the same jets at parton, particle, and detector 
level. This feature is clearly desirable from the 
standpoint of both theory and experiment. 

6. Straightforward Implementation: The algorithm 
should be straightforward to implement in per- 
turbative calculations. 




Figure 1. An illustration of infrared sensitivity in 
cone jet clustering. In this example, jet clustering be- 
gins around seed particles, shown here as arrows with 
length proportional to energy. We illustrate how the 
presence of soft radiation between two jets may cause a 
merging of the jets that would not occur in the absence 
of the soft radiation. 



2.2. Experimental Attributes of the Ideal 
Algorithm 

Once jets enter a detector, the effects of particle 
showering, detector response, noise, and energy from 
additional hard scatterings from the same beam cross- 
ing will subtly affect the performance of even the most 
ideal algorithm. It is the goal of the experimental 
groups to correct for such effects in each jet analysis. 
Ideally the algorithm employed should not cause the 
corrections to be excessively large. From an "experi- 
mental standpoint" we add the following criteria for a 
desirable jet algorithm: 

1. Detector independence: The performance of the 
algorithm should be as independent as possible 
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Efficient use of computer resources: The jet al- 
gorithm should provide jet identification with a 
minimum of computer time. However, changes 
in the algorithm intended to minimize the nec- 
essary computer resources, e.g., the use of seeds 
and preclustering, can lead to problems in the 
comparison with theory. In general, it is better 
to invest in more computer resources instead of 
distorting the definition of the algorithm. 



Figure 2. An illustration of collinear sensitivity in jet 
reconstruction. In this example, the configuration on 
the left fails to produce a seed because its energy is split 
among several detector towers. The configuration on 
the right produces a seed because its energy is more 
narrowly distributed. 




Figure 3. Another collinear problem. In this case we 
illustrate possible sensitivity to Et ordering of the par- 
ticles that act as seeds. 



of the detector that provides the data. For exam- 
ple, the algorithm should not be strongly depen- 
dent on detector segmentation, energy response, 
or resolution. 



5. Maximal reconstruction efficiency: The jet algo- 
rithm should efficiently identify all physically in- 
teresting jets {i.e., jets arising from the energetic 
partons described by perturbative QCD). 

6. Ease of calibration: The algorithm should not 
present obstacles to the reliable calibration of the 
final kinematic properties of the jet. 

7. Ease of use: The algorithm should be straight- 
forward to implement with typical experimental 
detectors and data. 

8. Fully specified: Finally, the algorithm must be 
fully specified. This includes specifications for 
clustering, energy and angle definition, and all 
details of jet splitting and merging. 

These experimental requirements are primarily a 
matter of optimization under real-life conditions and 
will, in general, exhibit complicated sensitivities to 
running conditions. They have a strong bearing on 
the ease with which quality physics measurements are 
achieved. Many of the details necessary to fully imple- 
ment the jet algorithms have neither been standard- 
ized nor widely discussed and this has sometimes led 
to misunderstandings and confusion. The remainder of 
this chapter describes the cone and Kt algorithms dis- 
cussed and recommended by the QCD at Run II Jets 
Group. 



2. Minimization of resolution smearing and angle 
biases: The algorithm should not amplify the in- 
evitable effects of resolution smearing and angle 
biases. 

3. Stability with luminosity: Jet finding should not 
be strongly affected by multiple hard scatterings 
at high beam luminosities. For example, jets 
should not grow to excessively large sizes due to 
additional interactions. Furthermore the jet an- 
gular and energy resolutions should not depend 
strongly on luminosity. 



3. Cone Jet Algorithms 

3.1. Introduction 

This section should serve as a guide for the defini- 
tion of common cone jet algorithms for t he T cvatron 
and possibly future experiments. Section |3.2 



reviews 



the features of previously employed cone algorithms. 
Section p^3| describes a seedless cone algorithm. Sec- 



tion 3.4 gives a description of seed-based cone algo- 
rithms and discusses the need for adding midpoints 
between seeds as alternate starting points for cluster- 
ing. Finally, in Section 3.5, we offer a detailed proposal 
for a common cone jet algorithm in Run II analyses. 
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3.2. Review of Cone Algorithms 

Cone algorithms form jets by associating together 
particles whose trajectories (i.e., towers whose cen- 
ters) lie within a circle of specific radius R in rj x <f> 
space. This 2-dimensional space is natural in pp colli- 
sions where the dynamics are spread out in the longitu- 
dinal direction. Starting with a trial geometric center 
(or axis) for a cone in r\ x <f> space, the energy- weighted 
centroid is calculated including contributions from all 
particles within the cone. This new point in r\ x cf> is 
then used as the center for a new trial cone. As this 
calculation is iterated the cone center "flows" until a 
"stable" solution is found, i.e., until the centroid of the 
energy depositions within the cone is aligned with the 
geometric axis of the cone. This leads us to our ini- 
tial cone algorithm based on the Snowmass scheme [ 
U of scalar ^-weighted centers. The particles are 
specified by massless 4- vectors (E l — |p l |,p l ) with 
angles [(f) 1 , 6 l , rf = — In (tan(0 l /2)j) given by the di- 
rection from the interaction point with unit vector 
P 1 = p l /E l . The scalar Et for each particle is 
Eip — E l sm(6 l ). For a specified geometric center 
for the cone (?? ,</> ) the particles i within the cone 
satisfy 



i C C* 



(77* - 77°) + 



b c f < R. 



(1) 



In the Snowmass algorithm a "stable" cone (and po- 
tential jet) satisfies the constraints 



V 



cc Et^ 1 



E$ 



E% 



(2) 



(i.e., the geometric center of the previous equation is 
identical to the -Ey-weighted centroid) with 



Ek 



iCC 



(3) 



Naively we can simply identify these stable cones, and 
the particles inside, as jets, J — C. (We will return to 
the practical issues of the impact of seeds and of cone 
overlap below.) 

To complete the jet finding process we require a re- 
combination scheme. Various choices for this recombi- 
nation step include: 

1. Original Snowmass scheme: Use the stable cone 
variables: 



E^ = E T — Ej. , 

iCJ=C 

^ = w E e tv\ 

T iC.J=C 

<t> J = 4 E e tP- 

T iCJ=C 



(4) 
(5) 
(0) 



2. Modified Run I recombination schemes: After 
identification of the jet as the contents of the sta- 
ble cone, construct more 4- vector-like variables: 



Et 



El = 



E 1 . 



E T ■ cos(^) 
E T ■ sin(^) 
E l ■ cos(0 l ) 

E E 



x,y,z ' 



iCJ=C 



, J( E t) 2 + W 

E^-) 



(7) 
(8) 
(9) 
(10) 

(11) 



A) In Run I, D0 used the scalar Ej, sum as de- 
fined in Eq. H but used the following definitions 
for ?y J and ^ 



-In I tan(y) 
E J 

tan-(^). 



(12) 
(13) 



B) In Run I, CDF used the angular definitions 
in Eqs. p^Rl and also replaced the Snowmass 



scheme Ef with: 



E^p 



E J 



E J 



iCJ 



(14) 



Note that in the Snowmass scheme the designation of 
the centroid quantities rj J and 4> J of Eqs. [| and |as a 
pseudorapidity and an azimuthal angle is purely con- 
vention. These quantities only approximate the true 
kinematic properties of the massive cluster that is the 
jet. They are, however, approximately equal to the 
"real" quantities, becoming exact in the limit of small 
jet mass (M J « Et). Further these quantities trans- 
form simply under longitudinal boosts (i.e., n J boosts 
additively while <fi J is invariant) guaranteeing that the 
jet structure determined with the Snowmass algorithm 
is boost invariant. It is also worthwhile noting that the 
Snowmass rf is a better estimator of the "true" jet 
rapidity (y J ) defined below than the "true" jet pseu- 
dorapidity defined in Eq. [T^. The latter quantity does 
not boost additively (for M J > 0) and is not a good 
variable for systematic studies. 

While the scalar sum Et is invariant under longi- 
tudinal boosts, it is not a true energy variable. This 
feature leads to difficulty in resummation calculations: 
the kinematic boundary of the jet Et shifts away from 
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y/s/2 appropriate for two parton kinematics when ad- 
ditional final state partons are included and the jet 
acquires a nonzero mass. On the other hand the Snow- 
mass variables have the attractive feature of simplicity, 
involving only arithmetic rather than transcendental 
relationships. An alternate choice, which we recom- 
mend here, is to use full 4- vector variables for the jets. 



3. E-Scheme, or ^-vector recombination: 



iCJ=C 



PT 



l E J + vi 

— In ■ 



- Pi 



/ = tan" 1 4 

pi 



(15) 

(16) 
(17) 



Note that in this scheme one does not use the scalar Et 
variable. The 4-vector variables defined above mani- 
festly display the desired Lorentz properties. Phase 
space boundaries will exhibit the required stability nec- 
essary for all-order resummations. While the structure 
of analytic fixed order perturbative calculations is sim- 
pler with the Snowmass variables, NLO cross section 
calculations are now also possible with Monte Carlo 
programs [ ||, ^, [l(], [ll]] . Such programs are fully flex- 
ible with respect to the choice of variables and the 4- 
vector variables pose no practical problems. It is also 
important to recall that, at least at low orders in per- 
turbation theory, it is not possible for energy to be 
conserved in detail in going from the parton level to 
the hadron level. At the parton level the jet will al- 
most surely be a cluster of partons with non-zero color 
charge. At the hadron level the cluster will be com- 
posed of color-singlet hadrons. The transition between 
the two levels necessarily involves the addition (or sub- 
traction) of at least one colored parton carrying some 
amount (presumably small) of energy. 

One can also employ these true 4-vector variables, 
rather than the -Er-weighted centroid, in the jet algo- 
rithm to find stable cones. While this choice will com- 
plicate the analysis, replacing simple arithmetic rela- 
tionships with transcendental relationships, the group 
recommends that this possibility be investigated. The 
goal is to have a uniform set of kinematic variables 
with appropriate Lorentz properties throughout the jet 
analysis. 

At this point it might seem that a simple and 
straightforward jet definition would arise from just the 
choice of a cone size and a recombination scheme. The 
algorithm would then be used to scan the detector and 
simply find all stable cones. In practice, this naive 
algorithm was found to be incomplete. To keep the 
time for data analysis within reasonable bounds the 



concept of the seed was introduced. Instead of look- 
ing "everywhere" for stable cones, the iteration process 
started only at the centers of seed towers that passed 
a minimum energy cut (how could a jet not have size- 
able energy deposited near its center?). Additionally, 
in Run I both CDF and D0 reduced the number of 
seed towers used as starting points by consolidating 
adjacent seed towers into single starting points. (The 
actual clustering was always performed on calorimeter 
towers.) These types of procedures, however, create 
the problems illustrated in Figs, [j], || and ||, introduc- 
ing sensitivity to soft emissions and the possibility of 
collinear sensitivity. 

The naive Snowmass algorithm also does not address 
the question of treating overlapping stable cones. It 
is quite common for two stable cones to share some 
subset (but not all) of their particles. While not all 
particles in the final state need to be assigned to a jet, 
particles should not be assigned to more than one jet. 
Hence there must be a step between the stable cone 
stage and the final jet stage where either the overlap- 
ping cones are merged (when there is a good deal of 
overlap) or the shared particles are split between the 
cones. Typically cones whose shared energy is larger 
than a fixed fraction (e.g., f — 50%) of the energy in 
the lower energy cone are merged. For the cases with 
shared energy below this cut, the shared particles are 
typically assigned to the cone that is closer in r\ x </> 
space. As suggested earlier, the detailed properties 
of the final jets will depend on the merge/split step 
and it is essential that these details be spelled out in 
the algorithm. We provide examples in the following 
sections. 

3.3. Cone Jets without Seeds 

Since many of the issues outlined in the previous 
section arise from the use of seed towers to define the 
starting point in the search for stable cones, it is worth- 
while to consider the possibility of a seedless cone algo- 
rithm. A seedless algorithm is infrared insensitive. It 
searches the entire detector and finds all stable cones 
(or proto-jetsf]), even if these cones do not have a seed 
tower at their center. Collinear sensitivity is also re- 
moved, because the structure of the energy depositions 
within the cone is unimportant. In this section we 
present a preliminary study of such an algorithm. 

3.3.1. Seedless Jet Clustering 

We give an example of a seedless algorithm in the 
flowchart in Fig. [|. The basic idea [ [l^] follows from 
the concept of "flowing" cone centers mentioned ear- 
lier. The location of a stable cone will act as an at- 



2 At the clustering stage we refer to stable cones as proto-jets. 
These may be promoted to jets after surviving the splitting and 
merging stage. 
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tractor towards which cones will flow during the iter- 
ation process. If the process starts close to such a 
stable center, the flow steps will be small. Starting 
points further from a stable center will exhibit larger 
flow steps towards the stable center during the itera- 
tion. Starting points outside of the region of attraction 
will again exhibit small flow steps. The method starts 
by looping through all detector towers^ in some appro- 
priate fiducial volume. For each tower k, with center 
k = (r] k ,(j) k ), we define a cone of size R centered on 
the tower 

c* = (i?°W,0 o W), 

i C C k : y] (77* - r) ck ) 2 + (ft - <P ck ) 2 < R. (18) 
For each cone we evaluate the .fir-weighted centroid 

C* = (v Ck J C ' k ) , (19) 



V 



iCC k 



- C h _ Y,icC k 



Eg" 



> (20) 
(21) 



Note that, in general, the centroid C k is not identical 

to the geometric center C k and the cone is not stable. 
While this first step is resource intensive, we simplify 
the subsequent analysis with the next step. If the cal- 
culated centroid of the cone lies outside of the initial 
tower, further processing of that cone is skipped and 
the cone is discarded. The specific exclusion distance 
used in this cut is a somewhat arbitrary parameter and 
could be adjusted to maximize jet finding efficiency 
and minimize the CPU demand of the algorithm. All 
cones that yield a centroid within the original tower 
become preproto-jets. For these cones the process of 
calculating a new centroid about the previous centroid 
is iterated and the cones are allowed to "flow" away 
from the original towers. This iteration continues un- 
til either a stable cone center is found or the centroid 
migrates out of the fiducial volume. The surviving sta- 
ble cones constitute the list of proto-jets. Note that 
the tower content of a cone will vary as its center moves 
within the area of a single tower. For a cone of radius 
R and tower dimension A (in either rj or ft) the mini- 
mum change in the cone center location for which the 
tower content in the cone changes by at least one tower 
is characterized by A 2 /2R. This distance is of order 
0.007 for A = 0.1 and R = 0.7 (i.e., 10% of a tower 
width if the diameter of the cone, 2i?, is ten times a 
tower width). 

3 While the algorithm may be run on individual detector cells, we 
do not believe that cell-level clustering is within the CPU means 
of current experiments for the largest expected data samples. 




Send proto-jets~\ 
to split/merge J 
algorithm J 



Calculate cone centroid (C) 
around next tower in list 
using radius R 




Remove 
this cone 



Add cone to 
list of proto-jets 



Figure 4. A seedless clustering algorithm. 
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An even more streamlined option would be to keep 
only those cones that yield a stable cone center without 
leaving the original tower. Since a trial cone is origi- 
nally placed at the center of every tower, the only dis- 
tinct stable cone centers missed by this (much!) faster 
algorithm correspond to very limited regions of attrac- 
tion (less than the area of a tower) . Such situations can 
arise in only two cases. One possibility is that there are 
two (or more) stable directions within a single tower. 
The second possibility is that there is a stable direc- 
tion within a tower but it is not found starting at the 
tower center. While both of these scenarios arise in 
analyses of realistic data, they do not constitute cause 
for concern. Proto-jets with directions that are nearly 
collinear (i.e., that lie within a single tower) will have 
nearly the same tower content and be merged with lit- 
tle impact on the final jet properties. Isolated stable 
directions with very small regions of attraction (the 
second case) are most likely fluctuations in the back- 
ground energy level and not the fingerprints of real 
emitted partons. In any case the stable cone centers 
not found by the streamlined algorithm invariably cor- 
respond to low Et proto-jets and are well isolated from 
large Et proto-jet directions (otherwise they would be 
attracted into the larger Et jet). Thus the leading Et 
jets (after merging and splitting) found by either the 
original seedless algorithm or the streamlined version 
are nearly identical. 

For practical use it may also be necessary to apply 
some minimum Et threshold to the list of proto-jets. 
Ideally such a threshold would be set near the noise 
level of the detector. However, a higher setting might 
be warranted to reduce the sensitivity of the algorithm 
to energy depositions by multip le interactions at high 
luminosities (see Section |3.3.4 for details of seedless 
clustering at the detector level). 

In general, a number of overlapping cones, where 
towers are shared by more than one cone, will be found 
after applying the stable cone finding procedure. As 
noted earlier, the treatment of proto-jets with overlap- 
ping regions can have significant impact on the behav- 
ior of the algorithm. 

3.3.2. Splitting and Merging Specifications 

A well-defined algorithm must include a detailed pre- 
scription for the splitting and merging of proto-jets 
with overlapping cones. We provide an outline of a 
splitting and merging algorithm in Fig. |^. It is im- 
portant to note that the splitting and merging process 
does not begin until all stable cones have been found. 
Further, the suggested algorithm always works with 
the highest Et proto-jet remaining on the list and the 
ordering of the list is checked after each instance of 
merging or splitting. If these conditions are not met, 



it is difficult to predict the behavior of the algorithm 
for multiply split and/or merged jets and similar lists 
of proto-jets can lead to distinctly different lists of jets. 
This undesirable situation does not arise with the well- 
ordered algorithm in Fig. [|. While there will always 
be some order dependence in a splitting and merg- 
ing scheme when treating multiply overlapping jets, 
we recommend fixing this order by starting with the 
highest Et proto-jet and working down in the Et or- 
dered list. In this way the action of the algorithm 
is to prefer cones of maximal Et- Note that, after 
a merging or splitting event, the Et ordering on the 
list of remaining proto-jets can change, since the sur- 
vivor of merged jets may move up while split jets may 
move down. Once a proto-jet shares no towers with 
any of the other proto-jets, it becomes a jet and is not 
impacted by the subsequent merging and splitting of 
the remaining proto-jets. As noted earlier and illus- 
trated in Fig. H, the decision to split or merge a pair 
of overlapping proto-jets is based on the percentage of 
transverse energy shared by the lower Et proto-jet. 
Proto-jets sharing a fraction greater than / (typically 
/ = 50%) will be merged; others will be split with 
the shared towers individually assigned to the proto- 
jet that is closest in r\ X <f> space. This method will 
perform predictably even in the case of multiply split 
and merged jets. Note that there is no requirement 
that the centroid of the split or merged proto-jet still 
coincides precisely with its geometric center. 

3.3.3. Parton Recombination 

The definition of calorimeter towers, i.e., a dis- 
cretization of (?7, 4>) space, would be cumbersome in 
a theoretical calculation, and is indeed not necessary. 
In a theoretical calculation at fixed order, the maximal 
number of partons, n, is fixed. With specified parton 
momenta, the only possible positions of stable cones 
are then given by the partitions of the n parton mo- 
menta, i.e., there are at most 2™ — 1 possible locations 
of proto-jets. They are given by the positions of indi- 
vidual partons, all pairs of partons, all combinations of 
three partons, etc. In a perturbative calculation, e.g. 
via a NLO Monte Carlo program, the proto-jet selec- 
tion of the seedless algorithm can then be defined as 
follows: 

1. Make a list of centroids for all possible parton 
multiplets. These are derived from the coordi- 
nates of all parton momenta pi, of all pairs of 
parton momenta pi + Pj , of all triplets of par- 
ton momenta pi +pj +Pk, etc. For each centroid 
record which set of partons defines it. 

2. Select the next centroid on the list as the center 
of a trial cone of radius R. 
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Go to the split/merge stage if the list of cone 
centers is exhausted. 




split proto-jcts 




Assign shared cells 


N 


to nearest 


proto-jets 




Recalculate proto-jets 




Goto Start 





merge proto-jcts 

Add neighbor's cells 

to this proto-jet 
and drop neighbor 

Recalculate this 
proto-jet 

Goto Start 



Figure 5. A fully specified splitting and merging algo- 
rithm. 



3. Check which partons are inside the trial cone. 

4. If the parton list of the centroid and that of the 
trial cone disagree, discard the trial cone and go 
to (2). If the lists agree, add the set of partons 
inside the trial cone as a new entry to the list of 
proto-jets. 

As before, different proto-jets may share partons, i.e. 
they may overlap. The required split/merge step is 
then identical to the calorimeter- level steps (Fig. |J), 
with towers replaced by partons as elements of proto- 
jets. 

In the case of analytic evaluations of the NLO per- 
turbative jet cross section [ |l3) the integrations over 
the multi-parton phase space are divided into various 
disjoint contributions. For a jet of fixed Ej,, rj J and <p J 
we have only the cases where a) one parton is in the jet 
direction with the jet Et, and the other partons are 
excluded from nearby directions where they could fit in 
a jet cone with the first parton, or b) two partons fit in 
a single cone with their centroid properties constrained 
to be the jet values. The questions of overlap, splitting 
and merging never arise at this order for R < 7r/3. 

3.3.4. Tests of a Seedless Algorithm 

In this section we offer some insight into the per- 
formance of the seedless cone algorithm applied to a 
detector. We begin by examining a simulated large- 
Et jet event in the D0 detector (Fig. ||). The event 
was chosen from a sample generated with pythia [ |l4| 
using a 160 GeV minimum Et cut at the parton-level 
generator. After hadronization, the events were pro- 
cessed through a full simulation of the D0 detector. 
The towers in the central region (—3.2 < r\ < 3.2) 
are 0.1 x 0.1 in size. Fig. ^ shows the distribution of 
calorimeter tower Et's for the event in the central fidu- 
cial volume (—2.4 < rj < 2.4) where cones of R = 0.7 
can be fully contained in the central region. Three jets 
clearly dominate the display (along with a less distinc- 
tive feature at the large t] boundary near cf> = 4). Fig. [?| 
shows the Et contained in a cone of radius 0.7 centered 
at each calorimeter tower, displaying the same struc- 
ture for the event in a slightly different language. We 
can make this picture even more clear by appealing 
to the "flow imagery" of Section 



3.3.1 



We define a 

flow vector as the 2-dimensional vector difference be- 
tween the calculated centroid for a cone centered on a 
tower and the geometric center of the tower (C — C k 
in Eqs. |lj and |l^). This vector vanishes for a stable 
cone. This flow vector is plotted in the corresponding 
range of r) x cf> in Fig. || for the same pythia generated 
event. 
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Figure 6. Calorimeter tower Et lego plot for a simu- 
lated large- Et jet event in the D0 Calorimeter. 




Figure 7. Et in cones centered on each calorimeter 
tower (in | 7 y tou ' er '| < 2.4) for the simulated large- Et jet 
event of Fig. [| 



The flow vector clearly points to the four potential 
jets noted above. Cones that are in the neighborhood 
of a potential jet exhibit flow vectors of large magni- 
tude pointing towards the jet center. This magnitude 
will generally be sufficient to cause the cone to fail the 
second test in Fig. |J thus preventing further iteration 
of the cone to define a proto-jet. The contours of Fig. || 
bound regions of flow with magnitude < 0.1 (solid con- 
tours) and < 0.05 (dashed contours) in r\ x </>, within 
which we expect to find the final jets. It is impor- 
tant to note the size of the detector regions with small 
flow magnitude. Regions with sufficiently small flow 
will pass test (2) in the clustering stage and allow the 
cone to undergo additional iterations. This ultimately 
increases processing time for clustering and complex- 
ity in splitting and merging (due to the production of 
many additional proto-jets). The flow magnitude cut 



Figure 8. Energy flow for the cones in the large- Et jet 
event of Figs. [j|-[7]. The contours bound flow regions 
with vector magnitude < 0.1 (solid contours) and < 
0.05 (dashed contours) in r\ x cj>. 



has a natural size on the order of the detector tower 
size. For the D0 detector, with a typical towers size 
of -q x <j) = 0.1 x 0.1, the cut would be between the 
two contours shown above. A too small magnitude cut 
will cause inefficiencies in jet finding; too large a cut 
will cause iterations on cones over the whole detector 
volume. 

It is clear from Figs. that the region of interest 
around the jets is much smaller than the area contained 
within the contours of "stable" cones. There are broad 
"plains" of low energy deposition where the flow vector 
is of small magnitude, but also of rapidly varying di- 
rection. Stable cones are found in these regions. But 
these presumably arise simply from local fluctuations 
yielding local extrema and are not expected to corre- 
spond to the fingerprints of underlying (energetic) par- 
tons. There are at least two, possibly parallel paths to 
follow in order to reduce the impact of these regions on 
the analysis, in terms of both required resources and 
final results. 

As already noted, we can further streamline the anal- 
ysis by applying the cut on the flow vector at each step 
in the iteration. Thus we keep only those cones that 
do not "flow" outside of their original tower before a 
stable center is reached. Such an algorithm converges 
rapidly to the stable cones pointed to by the largest 
magnitude flow vectors in Fig. || and efficiently elimi- 
nates most of the cones in the "plains" . We do lose the 
stable cones that a full iteration, allowing any amount 
of flow, finds in the flat regions of the previous figures. 
However, as already emphasized, these cones do not 
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Figure 9. A sample event from data. Tower Et lego 
plot for an event passing the D0 W — ► jets trigger. 



correspond to the physics we wish to study with jet 
analyses. With a large savings in analysis time the 
streamlined algorithm finds the same leading jet prop- 
erties {e.g., Et and r] J ) as the more complete algorithm 
to a fraction of a percent. The final jets contain typ- 
ically 120 to 160 towers. The differences between the 
leading jets found with the two algorithms arise from 
differences in tower content of just 1 or 2 towers (at 
the cone boundary). 

One can also reduce the effort and the final event 
complexity by applying a minimum Et cut on the 
cones at the proto-jet stage. An obvious choice for 
this minimum Et cut would be to place it above the 
level of detector noise. As alluded to in Section 3.3.1 



a practical cut might be placed slightly higher to re- 
duce sensitivity to varying event pileup with changes 
in beam luminosity. Unfortunately, this places a rather 
arbitrary threshold into the algorithm from the stand- 
point of theoretical calculations, i.e. what is the 'noise' 
level at NLO? Additionally, such cuts will in prac- 
tice be applied before final jet scale corrections. How 
does X GeV uncorrected in the experiment compare 
to X GeV at generator level? Such experiment specific 
considerations clearly are out of the realm of event gen- 
erator design! A possible improvement would be to set 
a minimum cone Et threshold equal to some fraction 
of the scalar Et in the event. In this way such effects 
will tend to partially cancel between generators and 
experiments, better relating the cut between the two 
levels. 

We next look at an example of the seedless algorithm 
tested on actual calorimeter data. Fig. || shows the 
tower Et lego plot for a D0 event passing a W — > jets 
trigger. The trigger required at least two central jets 



Figure 10. Et in cones centered on each calorimeter 
tower (in \rj T \ < 2.4) for the W — > jets sample event 
of Fig. §. 




Figure 11. Energy flow for the cones in the W — ► jets 
event of Figs. |9]-|To|. The contours bound flow regions 
with vector magnitude < 0.1 (solid contours) and < 
0.05 (dashed contours) in rj x <j>. 
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with Et > 15 GeV. These data were taken at high 
luminosity with an average of ~2.8 interactions per 
beam crossing. The two leading jets that pass the cut 
are reasonably obvious (along with, perhaps, two other 
subleading jets) but overall this event is clearly nois- 
ier (more realistic) than the PYTHIA generated event. 
This point is illustrated also in Figs. [II] and [0], which 
show the cone energy and flow vectors for this event, 
analogous to Figs. |?] and ||. In this case the base- 
line energy subtraction for calorimeter cell energies in 
the data leads to towers with (small) negative energy 
deposition. 

The increased level of noise and the possibility of 
negative tower energy results in two new issues for the 
jet analysis that were not observed in the analysis of 
the Monte Carlo data. The negative energy cells allow 
true stability with respect to the iteration process to 
be replaced by limit cycles. Iteration leads not only 
to cone center locations for which & — C' J =0 but 
also, for example, to doublets of locations for which 

0* = and = (?, or - (? = -(6* - C*). 
Thus continued iteration simply carries the cone center 
back and forth between location 1 (Ci) and location 2 

{(?). (More complex multiplets of locations with sets 
of 3, or even 6, 2-dimensional flow vectors summing 
to are also observed.) The good news is that these 
clusters of cone centers are typically close by each other 
and yield essentially the same final jets, after merging, 
independent of where in the limit cycle the iteration 
process is terminated. This is guaranteed to be true 
for the streamlined algorithm where the entire cycle 
must occur within a single tower. (The (rj x (j)) dis- 
tance between two members of such a limiting cycle 
driven by a negative tower energy of magnitude En is 
approximately R ■ En/Ec, where Eq is the total en- 
ergy in the cone. This can be as small as the minimum 
distance for a change of one tower in the cone as noted 
above, i.e., 7% of a tower width.) 

The noisy quality of the event leads to an even more 
troubling phenomenon. There are so many locally sta- 
ble cone centers found in the now rapidly fluctuating 
"plain" region that the proto-jet list may exhibit a sur- 
prisingly large number of mutually overlapping cones. 
During the merging phase these can coalesce into jets 
with large (even leading) Et- This issue has histori- 
cally been treated by applying a minimum Et cut to 
the proto-jet list before merging and splitting. With 
the event studied here a cut of 8 GeV (typical of val- 
ues used by D0) is not sufficient. If we keep all stable 
cones with Et > 8 GeV, with no other cuts, as proto- 
jets, the merging process builds a leading jet by pulling 
together many cones where there is clearly no real jet. 
This problem does not arise in the streamlined algo- 



rithm where only stable cones that stayed within their 
original tower are kept. In this case the algorithm 
identifies the leading jets anticipated intuitively from 
the above figures. 

3.3.5. Comments on the Seedless Clustering 

We may summarize the advantages of the seedless 
clustering described above as follows: 

1. Avoids undesirable sensitivity to soft and 
collinear radiation. 

2. Offers increased efficiency for all physically inter- 
esting jets. 

3. Offers improved treatment of limit cycles and 
overlapping cones. 

4. "Flow cut" method offers more efficient use of 
computer resources than unrestricted seedless 
clustering. 

We have not investigated further improvements in 
the optimization of the computational efficiency for 
this seedless algorithm. However, some improvement 
may be gained by using the fact that cones centered 
on adjacent towers are largely overlapping, thus reduc- 
ing the number of towers to sum for each new center. 
Other improvements such as region of interest (ROI) 
clustering may also be explored. 

3.4. Cone Jets with Seeds 

In an actual experiment the number of calorime- 
ter towers may be very large (order 6000 for tower 
sizes of Ai] x A0 = 0.1 x 0.1 and an r) coverage of 
±5 units of pseudorapidity) . The above seedless al- 
gorithm may then be expensive computationally. The 
question arises whether an acceptable approximation 
of the seedless algorithm can be constructed, analo- 
gous to the parton-level short cut, while considering 
primarily those towers which have energy depositions 
above a minimal seed threshold for finding proto-jets. 

Seed-based cone algorithms offer the advantage of 
being comparatively efficient in CPU time. In a typ- 
ical application, detector towers are sorted according 
to descending Et and only towers passing a seed cut, 

E tower > E seed ^ ^ 

are used as starting points for the initial jet cones. 
This greatly reduces the number of cones that need to 
be evaluated in the initial stage. The seed threshold 
jrjseed mus ^ ^ e c hosen low enough so that variations of 
E T eed lead to negligible variations in any observable un- 
der consideration. The simple seed-based algorithm is 
sensitive to both infrared or collinear effects. However, 
sensitivity to the splitting of the seed Et between mul- 
tiple towers is greatly reduced for larger Et jets. As 
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stated above, this is true when the jet reconstruction 
becomes 100% efficient (i.e., around 20GeV for jets in 
D0). For fully efficient jet algorithms the collinear de- 
pendency is reduced to a second-order effect, namely, 
the effective number of low Et proto-jets that may en- 
gage in splitting and merging. In a typical algorithm a 
minimum Et cut may also be applied to each proto-jet 
to prevent excessive merging of noise and energy not 
associated with the hard scattering producing the jets. 

3.4.1. Addition of Midpoints 

The seedless algorithm discussed previously can be 
approximated by a seed-based algorithm with the ad- 
dition of 'midpoints' in the list of starting seeds. The 
idea [ [l5|] is to duplicate the parton- level algorithm dis- 
cussed in Section 3.3.3| , but with partons replaced by 
seeds. By adding a starting point for clustering at the 
positions given by pi +Pj, pi +Pj +Pk etc., the sensitiv- 
ity of the algorithm to soft radiation as illustrated in 
Fig. [I] is essentially removed. Since widely separated 
seeds cannot be clustered to a proto-jet, it is sufficient 
to only consider those midpoints where all seeds lie 
within a distance 



AR < 2.0 ■ R c 



(23) 



of each other. 

With these changes, the resulting algorithm is quite 
close to those used in Run I of the Tevatron. The 
main change is the inclusion of midpoints of seeds (the 
Pi + pj pairs) and of centers of larger numbers of seeds 
as additional seed locations for trial cones. Two stud- 
ies of the effects of adding midpoints were completed 
during the workshop and are summarized below. The 
first checks the infrared safety of the midpoint algo- 
rithm, also called the Improved Legacy Cone Algo- 
rithm (ILCA), in a Monte Carlo study. The second 
tests the effect of adding midpoints on the performance 
of the Run I D0 cone algorithm. 

3.4.2. Results from a Monte Carlo Study 

The request for an infrared and collinear safe jet- 
algorithm is most important from the viewpoint of per- 
turbative QCD calculations. Unsafe algorithms simply 
do not permit unambiguous results, once higher order 
corrections are considered [ [l6| \17\ . Instead results 
will depend on the technical rcgularization procedure 
adopted in a specific calculation. 

The deficiencies of an unsafe algorithm will only 
show up at sufficiently high order in the perturbative 
expansion. For example, the jet merging due to soft 
gluon radiation as depicted in Fig. ^ will only become 
a problem when three partons or more can be com- 
bined to a single jet. In hadron collider processes this 
first happens in, for example, the NLO corrections to 
three-jet production [ p), where four-parton final states 



are included in the real emission contributions. The 
fourth parton is needed to provide the necessary re- 
coil transverse momentum to the other three partons 
which may or may not form a single jet. The NLO 
three-jet Monte Carlo is very CPU intensive, however, 
making it a cumbersome tool to investigate jet algo- 
rithms, at present. A much faster probe is provided by 
the existing NLO dijet Monte Carlos in DIS [0,0- 

In ep — > ejjX, the electron provides the necessary 
recoil pr to the final-state partons. The real emission 
QCD corrections at 0(a 2 ) thus contain three partons 
which can be close together. Their merging to a single 
jet, with the concomitant loss of two-jet cross section, 
is a probe of the infrared safety of the two-jet vs. one- 
jet classification of partonic events. A second probe is 
provided by the Et flow inside a jet, which has recently 
been modeled with up to three partons in a single jet, 
for the current jets in DIS [ |l8). 

We have investigated these issues with the mepjet 
Monte Carlo [ [To), which calculates dijet production 
in DIS at NLO. The program was run in a kinematical 
range typical for HERA, ep collisions at s/s = 300 GeV 
with Q 2 > 100 GeV 2 . Reconstructed jets were required 
to satisfy 



E T > 10 GeV, 



-l<y<2, Rjj < 2, 



(24) 



where E-scheme recombination is used. Here Rjj is the 
separation of reconstructed jets in the legoplot. Follow- 
ing HERA practice, we use a cone size R = 1 . Consid- 
ering jets with a maximal separation of twice the cone 
size enhances the statistical significance of any split- 
ting/merging effects in the Monte Carlo calculation. 

With these settings two cone algorithms are con- 
sidered to investigate the importance of extra mid- 
points in the perturbative results. The first is the 
seedless algorithm in its parton-level implementation 



as described in Section 3.3.3, which wc here call the 



"midpoint" algorithm. In order to test the analog of 
tower threshold effects, only partons with Er,i > E T eed 
are considered for centers of trial cones, i.e., trial cone 
centroids are the directions of these partons and their 
midpoints pi + pj and pi + pj + The second al- 
gorithm, dubbed "no center seed" is identical, except 
that the midpoints are left out as trial cone centers. 
For both algorithms, the final splitting/merging deci- 
sion is made with an £>p-fraction of / = 0.75 of the 
lower Et proto-jet as the dividing line. 

The mepjet program is based on the phase space 
slicing method, with a parameter s m i n defining the 
separation between three-parton final states on the 
one hand, and the virtual contributions plus soft and 
collinear real emission processes (which cancel the di- 
vergences of the virtual graphs) on the other. This 
dividing line is completely arbitrary and observables 
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Figure 12. Dependence of the DIS dijet cross section 
on s m in for the ILCA algorithm with midpoints (plain 
symbols) and for the "no center seed" algorithm (dia- 
monds). 



should not depend on it. A test of this require- 
ment is shown in Fig. [l2] where the dijet cross sec- 
tion within the cuts of Eq. ^ is shown as a func- 
tion of s m i n . Whereas the midpoint algorithm shows 
Smm-hidependence within the statistical errors of the 
Monte Carlo (plain symbols), leaving out the mid- 
points between partons leads to a pronounced decrease 
of the cross section as s m in becomes smaller. Smaller 
Smin implies that more events are generated as explicit 
three-parton hnal states. The additional soft gluons 
act as extra seeds that tend to merge the two jets, 
leaving the event classified as a one-jet event, which 
does not contribute to the plotted dijet cross section. 
The Smin dependence of the "no center seed" algorithm 
means that no perturbative prediction is possible for 
this algorithm: as s m j„ approaches zero, the dijet cross 
section diverges logarithmically as log s m in/Q 2 - 

Even when fixing s m in to some typical soft QCD 
scale, like s m in — 0.03 GeV 2 , the "no center seed" 
algorithm has fatal defects. This is demonstrated in 
Fig. [l3| where the variation of the dijet cross section 
within the cuts of Eq. ||J is shown as a function of 
"tower threshold" transverse energy E T eed . The mid- 
point algorithm is almost independent of this thresh- 



old, as long as E T eed is less than about 10% of the jet 
transverse energy. The "no center seed" algorithm, on 
the other hand, shows a pronounced threshold depen- 
dence, raising the specter of substantial dependence 
of jet cross sections on detector thresholds, detector 
response to soft particles and nonperturbative effects. 
These effects have been discussed previously for three- 
jet events at the Tevatron [ p[ p^j. 
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Figure 13. Dependence of the DIS dijet cross section 

Results are 



on the seed threshold E T eed of Eq. 
shown for ILCA, with midpoints (plain symbols) and 
for a "no center seed" variant (diamonds). 



Discarding the "no center seed" algorithm we turn to 
internal Et flow inside a single jet as another measure 
of the performance of jet algorithms. The differential 
jet shape, p{r), is defined as 1/Ar times the average 
Et fraction of a jet in a narrow ring of width Ar, a 
distance r from the jet axis. In Fig. [li] the differential 
jet shape is shown for current jets at HERA, in the 
phase space region 



E T > 14 GeV 



-1< T) < 2 



(25) 



for DIS events with Q 2 > 100 GeV 2 . Results are shown 
for the midpoint (ILCA) and the Kt algorithm (to be 
described later) at NLO (0(a 2 s )). The midpoint algo- 
rithm produces wider jets than the Kt algorithm with 
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D = R, as is to be expected since two partons with a 
separation slightly less than 2R can be clustered by the 
midpoint, but not the Kt algorithm. NLO corrections 
are quite small for the midpoint algorithm. We have 
also checked that the jet shapes in the midpoint algo- 
rithm exhibit good scale dependence at NLO, similar 
to the K T algorithm [ p). 



initially found around all seed towers above a 1 GeV 
threshold, then around all midpoints. In the second 
case they were first found around all midpoints be- 
tween seed towers, then around the seed towers them- 
selves. Fig. |l5] shows the Et distributions for three 
trials, the legacy seed, seed + midpoint, and midpoint 
+ seed trials. Also shown are the ratios of the Et 
spectra. A cone radius of 0.7 was used. 
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Figure 14. Jets shapes in ILCA (dashed line) compared 
to Kt (solid line). 



3.4.3. Results from Data Study 

A midpoint algorithm has previously been employed 
by the OPAL Collaboration 



19 



We now report a 
study performed using the D0 data. The data were 
acquired from a two-jet trigger sample with an average 
of 2.8 interactions per beam crossing. The goal of the 
data-based study was to test the sensitivity of DO's 
Run I cone algorithm to the addition of midpoints. 
To facilitate a direct comparison of Run II jet results 
with the current data it is desirable that algorithms 
supported^] for the new data produce similar results. 

Details in the D0 Run I jet algorithm forced the 
splitting and merging of jets to occur as they are found. 
In effect this defines an order dependence based on the 
seed Et of the jets. It was possible to test two order- 
ings in the jet clustering. In the first case, jets were 

4 While any number of jet algorithms may in principle be in- 
cluded in an offline analysis stream, in practice only a few algo- 
rithms will typically be fully supported by detailed energy scale, 
resolution, and efficiency corrections. 



Figure 15. Jet Et distributions and ratios. Top: Jet 
Et distributions for the three algorithms overlayed. 
Legacy seeds (large circles), seeds + midpoints (stars), 
midpoints + seeds (small circles). Middle: Seeds + 
midpoint distribution divided by the legacy distribu- 
tion. Bottom: Midpoint + seeds distribution divided 
by the legacy distribution. 



There are two effects to observe in Fig. |l9. First, the 
addition of midpoints tends to cause an increase in the 
number of low Et jets. This is because the midpoints 
are effectively zero threshold seeds, therefore very soft 
jets that tend to fail reconstruction by falling short of 
the seed requirement may sometimes be reconstructed 
around a midpoint. Second, the results are different 
depending on the order in which the seeds + midpoints 
are used. However, we can safely conclude that the ad- 
dition of midpoints has little more than a few percent 
effect on the experimental jet Et distribution. 

Fig. [l6| shows the ratio of the leading jet for the 
legacy seed and midpoint + seed algorithms. Since 
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a meaningful test requires the comparison of the same 
jets, the jets were also required to be matched within a 
radius of 0.2 (in A 77 x A<f>) to prevent accidental com- 
parisons of unrelated jets due to 'flipping' of the jet 
order between algorithms. Fig. [I?] shows the fractions 
of isolated, merged, split, and multiply split/merged 
jets for the legacy seed and midpoint + seed algo- 
rithms. In each case only small variations arc observed 
between the two algorithms, indicating that a legacy 
cone algorithm augmented by midpoints is an accept- 
able choice for comparisons to Run I physics results. 
In fact, Figs. [IB] and |l6| represent extreme deviations in 
jet Et, since Et differences are expected to be reduced 
after application of jet energy corrections appropriate 
to each algorithm. 

3.5. Proposals for Common Run II Cone Jet 
Algorithms 

The cone algorithm starts with a cone defined in E- 
scheme variables as 



* c C 



< R. 



(26) 



where for massless towers, particles, or partons y l = if. 
The E-scheme centroid corresponding to this cone is 
given by 
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A jet arises from a "stable" cone, for which y c 



y 



y and <j> 
properties 



and the jet has kinematic 
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= (E J ,p J ) = 



E 

iCJ=C 



(E\p x ,pi,pi) , 



V 



(pi) 2 

1 , E J 
— In — t 

2 E J 



(P j y) 2 

■pi 



tan 



Pz 



-xPl_ 

pi 



(29) 
(30) 
(31) 



Seedless algorithm. For a seedless algorithm we rec- 
ommend the streamlined jet algorithm defined in Sec- 
tion 3.3.1 that includes the flow cut for computational 
efficiency improvement and reduction of soft proto-jet 
construction. The clustering or jet finding should be 
done in terms of E-scheme variables. 

Seed-based algorithm or ILCA. Backwards compat- 
ibility is important here as well as common specifica- 
tions between experiments. For the Run II algorithm 
we recommend that jet clustering commence on each 
seed tower (rather than consolidated seeds as in Run I) , 
for simplicity of the algorithm and to reduce depen- 
dencies on detector segmentation. Since the finding 
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Figure 16. Et ratios for leading jets. The ratio 
of leading jet Et in the midpoint algorithm is 
plotted as a function of the legacy cone jet's Et- 
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Figure 17. A view of splitting and merging frac- 
tions in the legacy seed (solid) and midpoint + 
seed algorithms (dotted). 
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Generate midpoint 
list from protojets 
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Figure 18. Method for addition of midpoints. 



of proto-jets is determined by the seed threshold, it is 
reasonable to determine the midpoints based on the 
positions of the proto-jets rather than the seed list it- 
self, as illustrated in Fig. |l8|. This would reduce the 
number of midpoints to be calculated due to the large 
combinatorics caused by adjacent seed towers within 
jet cones. 

Specifications Summary We list here the precise 
specifications of the jet algorithms and variables: 

1. ^cone- O.T 

2. p s T eed : 1.0 GeV 

3. Recombination: E-schemc 

4. Midpoints: Added after cone clustering 

5. Split/Merge: pr ordered, threshold = 50% of 
lower pt jet 

6. Reported kinematic variables: E-scheme, ei- 
ther directly as (E J ,p J ) or as (m J ,pt j ,y J ,4> J ), 



where 



the mass of the jet (m J 



E^ 



4. Kt Jet Algorithms 



4.1. Introduction 

This section provides a guide for the definition of 
Kt jet algorithms for the Tevatron. Section 4.2 de- 
scribes the recommended algorithm in detail. Sec- 



tion 4.3 discusses preclustering of particles, cells, or 
towers for both the CDF and D0 experiments. Sec- 



tions 4.4 and 4.5 outline momentum calibration of the 
Kt algorithm and briefly describe jet resolution. Fi- 



nally, in Section 4.6, we provide a few examples of the 



versatility of the Kt algorithm. 

4.2. The Run II K T Algorithm 

In this section we propose a standard Kt jet algo- 
rithm for Run II at the Fermilab Tevatron. This pro- 
posal, based on studies of the Kt algorithm by several 
groups [ Hi], p2|, establishes a common algorithm 
that satisfies the general criteria presented in Section 
1. 

The Kt jet algorithm starts with a list of preclusters 
which are formed from calorimeter cells, particles, or 
partons.H Initially, each precluster is assigned a vector 



(E, p) = E ( 1 , cos <fi sin 9, sin <f> sin 9, cos ( 



(32) 



where E is the energy associated with the precluster, 
4> is the azimuthal angle, and 9 is the polar angle with 
respect to the beam axis. For each precluster, we calcu- 
late the square of the transverse momentum, p T , using 



2 2,2 

Pt =P x +P y 



and the rapidity, y, using^] 

Pz 



y 



2 E- Pz 



(33) 



(34) 



A flowchart of the Kt algorithm is shown in Fig. [19[ 
Starting with a list of preclusters and an empty list of 
jets, the steps of the algorithm are as follows: 



1. For each precluster i in the list, define 



: Pt,, 



(35) 



For each pair (i,j) of preclusters [i ^ j), define 



dij = min (p T i ,p T . 



^(pt^Ptj) 



/4 

S) D 2 

{Vi -Vj? 



D 1 



■(36) 



where D « 1 is a parameter of the jet algorithm. 
For D — 1 and AlZij -C 1, dij is the minimal 
relative transverse momentum k± (squared) of 
one vector with respect to the other. 



5 Preclustering is discussed in detail in Section 4.3 
6 To avoid differences in the behavior of the algorithm due to 
computational precision when \y\ is large, we assign y 

\y\ > io. 



±10 if 
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2. Find the minimum of all the di and dij and label 
it d m i n . 

3. If d m in is a dij, remove preclusters i and j from 
the list and replace them with a new, merged 
precluster (Eij,Pij) given by 



Eij 
Pij 



(37) 
(38) 



Start with a list 
of preclusters 



For each precluster, calculate 

di = pf-j 

For each pair of preclusters, calculate 
4, minW,,,^)* »'•,;<* *<* 



4. If d m in is a di, the corresponding precluster i 
is "not mergable." Remove it from the list of 
preclusters and add it to the list of jets. 

5. If any preclusters remain, go to step 1. 

The algorithm produces a list of jets, each separated 
by A1Z > D. Fig. illustrates how the Kt algorithm 
successively merges the preclusters in a simplified dia- 
gram of a hadron collision. 

The Kt algorithm presented above is based on sev- 
eral slightly different Kt jet clustering algorithms for 
hadron colliders [ |2(], ^l], p2j. The main differences 
have to do with (1) the recombination scheme and (2) 
the method of terminating the clustering. The choices 
in the proposal above are discussed in the following 
paragraphs. 

The recombination scheme was investigated by 
Catani et al. [ pTj|. We elect to use the covariant E- 
scheme (Eqs. |37|-|38|), which corresponds to vector ad- 
dition of four-momenta, because our goals are 

1. conceptual simplicity, 

2. correspondence to the scheme used in the Kt 
algorithm for e + e~ collisions [12^, 

3. absence of an energy defect [|24[], an d 

4. optimum suitability for the calibration method 



Identify d mjn: the minimum 
of all the cL and du 



described in Section 4.4. [ 



The prescription of Catani, et al. [pC, 21 introduces 
a stopping parameter, d cu t, that defines the hard scale 
of the physics process and separates the event into a 
hard scattering part and a low-pT part ("beam jets"). 
Catani et al. suggest two ways to use the d cu t parame- 
ter. First, d cu t can be set to a constant value a priori, 
and when dmin > d cut the algorithm stops. At this 
point, all previously identified jets with p T < d cu t are 
classified as beam jets, and all remaining preclusters 
with p T i > d cut are retained as hard final-state jets. 
Alternatively, an effective d cu t can be identified on an 
event-by-event basis so that clustering continues until 
a given number of final-state jets are reconstructed. 

Unlike Catani, et al., the algorithm proposed by Ellis 
and Soper [ E2| continues to merge preclusters until all 



Remove preclusters i and j 
and replace them with a new, 
merged precluster 




Remove precluster i from 
the list of preclusters and 
add it to the list of jets 




Figure 19. The Kt jet algorithm. 



jets are separated by A1Z > D. We have adopted this 
choice. Besides its simplicity, this method maintains 
a similarity with cone algorithms in hadron collisions. 
Whereas the use of d cu t is well suited for defining an 
exclusive jet cross section (typical of e + e~ collisions), 
we desire an algorithm that defines inclusive jet cross 
sections in terms of a single angular resolution param- 
eter D, which is similar to R for cone algorithms. 

4.3. Preclustering 

As described in the previous section, the input to 
the Kt jet algorithm is a list of vectors, or preclus- 
ters. Ideally, one should be able to apply the Kt 
algorithm equally at the parton, particle, and detec- 
tor levels, with no dependence on detector cell type, 
number of cells, or size. The goal of preclustering 
is to strive for order independence and detector in- 
dependence by employing well-defined procedures to 
remove (or reduce) the detector-dependent aspects of 
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ters all exhibit positive energy. Candidate schemes to 
achieve these goals are described in detail in the follow- 
ing sections. However, it is important that the preclus- 
tering scheme does not introduce the sort of problems 
with infrared or collinear sensitivities that we earlier 
discussed for the case of seeds. 
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con- 



4.3.1. CDF Preclustering 

The CDF calorimeter system for Run II 
sists of 1,536 towers. Each tower consists of an elec- 
tromagnetic (EM) component and a hadronic (HAD) 
component. In order to form preclusters for input to 
the Kt algorithm, we propose the following: 

1. Measure the amount of EM energy deposited into 
each calorimeter tower, Eem, and form the vec- 
tor (E E m,Pem) where 



Px,em = E E m cos <j) sin 9 E m , 
Py.EM = E E m sin tf> sin 9 E m , 
Pz,em — Eem cos 9 em ■ 



(39) 
(40) 
(41) 



Figure 20. A simplified example of the final state of a 
hadron collision. The open arrows represent preclus- 
ters in the event, and the solid arrows represent the 
final jets reconstructed by the Kt algorithm. The six 
diagrams show successive iterations of the algorithm. 
In each diagram, either a jet is defined (when it is well 
separated from all other preclusters), or two preclus- 
ters are merged (when they have small relative k±_). 
The asterisk labels the relevant precluster(s) at each 
step. 



jet clustering. Practically, however, this independence 
is very difficult to achieve. For example, if a single 
particle strikes the boundary between two calorimeter 
towers, two clusters of energy may be measured. Con- 
versely, two collinear particles may shower in a single 
calorimeter tower so that only one vector is measured 
experimentally. Preclustering all vectors within a ra- 
dius larger than the calorimeter tower size removes this 
problem. 

At the parton and particle levels, the simplest pos- 
sible preclustering scheme is to identify each parton 
or particle four-vector as a precluster. Experimen- 
tally, differences between the geometries of the CDF 
and D0 calorimeters necessitate different preclustering 
schemes. In particular, the D0 discussion describes 
how the preclustering scheme can be used to control 
the number of preclusters passed to the Kt algorithm 
in order to keep the jet analysis computationally fea- 
sible. It can also be used to ensure that the preclus- 



Likewise, measure the amount of HAD energy de- 
posited into each calorimeter tower, Eh ad, and 
form the vector (Eh ad, Phad) where 

Px.had = Eh ad cos cj> sin 9 had , (42) 
Pv.had = E H ad sin (f> sin 9 had , (43) 
Pz.had = Eh ad cos 9 had ■ (44) 

The angles 9 em, Oh ad and <j> specify the posi- 
tion of the calorimeter tower components with 
respect to the interaction point. Note that 9 em 
and 9had may take on slightly different values 
when calculated using different interaction points 
along the beam axis (see Fig. |2l| ). 

2. For each calorimeter tower, calculate a vector 
(E, p) by summing the vectors for the EM and 
HAD components: 

(E, p) = (E E m + E H ad, Pem + Phad) (45) 

3. For each calorimeter tower, calculate the pt from 
its associated vector using 

Pt = \JpI+pI 

= E E m sin 9 em + E H ad sin 9 had ■ (46) 

4. Assemble a list of tower vectors for which 

PT > pT n , (47) 
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where p^ m w 100 MeV.[| These are the preclus- 
ters for the Kt algorithm. 

In designing the CDF preclustering scheme, the pri- 
mary goal was simplicity. We made every attempt 
to maintain a close relationship between the physical 
calorimeter towers and the input preclusters for the 
Kt algorithm. 




Figure 21. Schematic of a single CDF calorimeter 
tower. 



4.3.2. D0 Preclustering 

The Kt jet algorithm is an C(n 3 ) algorithm, where 

Limited 



n is the number of vectors in the event [ 20 



computer processing time does not allow this algorithm 
to run on the ~ 45000 cells or even the ~ 6000 towers of 
the D0 calorimeter. Therefore, we employ a preclus- 
tering algorithm to reduce the number of vectors input 
to the algorithm. Essentially, towers are merged if they 
are close together in 77 x (j) space, or if they have small 
Pt (or negative pr, as explained below). The preclus- 
tering algorithm described below was used by the D0 
experiment in Run I. We examine the effects of the Run 
I preclustering algorithm, and discuss possible alterna- 
tives for Run II. Although the effects of preclustering 
on jet observables should be small, this is analysis and 
detector dependent. A Monte Carlo study of preclus- 
tering effects on the jet pt and on jet structure is also 
presented. 

In Run I, one use of preclustering was to account 
for negative energy calorimeter towers [ [27j] which can 
cause difficulties for the Kt algorithm. In the D0 



7 This pt cut is designed to retain towers with energies well above 
the level of electronic noise. The exact value for this pt cut will 
depend on measurements of calorimeter performance. 



calorimeter, we measured the difference in voltage be- 
tween two readings (peak minus base) , as illustrated in 
Fig. |2^. To first order, this online baseline subtraction 
technique removes the effect of luminosity-dependent 
noise in the calorimeter, on a tower- by-tower basis. 
Residual fluctuations in each reading, however, some- 
times lead to measured energies which are negative. 
One can imagine at least four ways to deal with these 
negative energy towers. 

1. Absorb the negative energy into a precluster of 
towers such that the overall precluster energy is 
positive, as will be discussed here. 

2. Add an offset to all tower energies so that there 
are none with negative energy. The offset could 
then be removed later in the analysis. 



3. Ignore all towers with negative energy, i.e., 
move them from the jet analysis. 



rc- 



4. Proceed with the Kt algorithm analysis includ- 
ing the negative energy towers, assuming that 
their impact is negligible. Recall that in the 
cone algorithm case the negative energy towers 
are the source of the observed limit cycles for 
quasi-stable cones, which does not seem to be a 
serious problem. 

Clearly, further studies of this issue are required. The 
precluster scheme can also be used to absorb low pt 
towers similarly to what is done for negative energy 
towers. 

The Run I preclustering algorithm, which is em- 
ployed in the following studies, has six steps: 

1. Identify each calorimeter cell with a 4- vector 
(E,p) = E (1, cos <f>sm0, sin^sin 9, cosd) where 
E is the measured energy in the cell. For each 
cell, define 



Pt 



p\ = E sin ( 



and 



In tan • 



(48) 



(49) 



Remove any calorimeter cells with pt < —500 
MeV. Cells with slightly negative pt are allowed 
due to pileup effects in the calorimeter, but cells 
with highly negative pt are very rarely observed 
in minimum bias events and are thus considered 
spurious, so they are removed. 
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tb t p t 

Figure 22. Schematic of voltage in a calorimeter 
cell as a function of time. The solid line shows the 
contribution for a given event (the current cross- 
ing). The cell is sampled once at tb, just before 
a pp bunch crossing, to establish a base voltage. 
The voltage rises during the time it takes electrons 
to drift in the liquid argon gap (^500 ns), and 
reaches a peak value at t p ss 2 fjs, which is set by 
pulse-shaping amplifiers in the signal path. The 
cell is sampled again at t p , and the voltage differ- 
ence AV — V(t p ) — V(tb) is proportional to the 
raw energy in the cell. Because the decay time of 
the signal t w 30 ps is much larger than the ac- 
celerator bunch crossing time t x = 3.5 fxs, V(tb) 
may have a contribution from a previous bunch 
crossing. The size of this contribution is related to 
the number of pp interactions in the previous cross- 
ing, which depends on the beam luminosity. The 
dashed lines show an example contribution from a 
previous bunch crossing containing three different 
numbers of interactions. The figure is not drawn 
to scale. 

3. For each calorimeter tower, sum the transverse 
momenta of cells within that tower: 



tower 

Pt 



E 



Pt 



ell 



(50) 



cell £ tower 



4. Merge towers if they are close together in -q x <\> 
space: 

(a) Form an 77-ordered (from most negative to 
most positive) list of towers; towers with 
equal r\ are ordered from <fi = to <f> — 27r. 

(b) Remove the first tower in the list and call it 
a precluster. 

(c) From the remainder of the list, find the clos- 
est tower to the precluster. 

(d) If they are within AK P = A-q 2 + Acj) 2 = 
0.2, remove the closest tower from the list, 



and combine it with the existing precluster, 
forming a new precluster; go to |^]. 

(e) If any towers remain, go to |4b|. 

5. Preclusters which have negative transverse mo- 
mentum pt = Pt- < are redistributed to 
neighboring preclusters. Given a negative pt 
precluster with (pr-, ?7-, <fi—), we define a search 
square of size (?y_±0.1) x (</!>_ ±0.1). If the vector 
sum of positive pt in the search square is greater 
than |pt-|, then pr- is redistributed to the pos- 
itive pt preclusters in the search square. Oth- 
erwise, the search square is increased in steps of 
A?/ = ±0.1 and Acj) = ±0.1, and redistribution is 
again attempted. If redistribution still fails with 
a search square of size (r)- ± 0.7) x (0_ ± 0.7), 
the pt of the negative momentum precluster is 
set to zero. 

6. Preclusters which have pt < p^ = 200 MeV 
are redistributed to neighboring preclusters, as 
in step 5. We make the additional requirement 
that the search square have at least three pos- 
itive pt preclusters, to reduce the overall num- 
ber of preclusters. The threshold p v T was tuned 
to produce ^200 preclusters/event, as shown in 
Fig. [2ij, to fit processing time constraints. Next, 
jets are reconstructed from the preclusters. 

In steps 4-6, the combination followed a Snowmass 
style prescription: 



Pt 
V 



Pr,i + Pt.j , 

PT,iT]i +PT,jT)j 



PT,i 
PT,i<t>i 



' PT,j 
-pT,j<Pj 



PT,i + PTJ 



(51) 
(52) 

(53) 



As a minimal change to the Run I preclustering algo- 
rithm, a possible Run II preclustering proposal should 
instead use vector addition of four-momenta. The Run 
II preclustering algorithm should also use y (as defined 
by Eq. |34]) instead of 77 and a true 2-vector pt rather 
than the scalar px of Eq. [5l]. Generally, the definitions 
of variables and recombination scheme in the preclus- 
tering algorithm should match the choices used in the 
proposed Kt jet algorithm. All of the results presented 
here used the Run I preclustering algorithm. 

The preclustering radius ATZ p in step 4 of the algo- 
rithm above can be used to test the sensitivity of jets to 
the calorimeter segmentation size, A<j> x A77 = 0.1 x 0.1 
(or smaller) in the D0 calorimeter. Preclustering with 
ATZ P = 0.2 > At] or A(f> in step 4 of the algorithm 
mimics a coarser calorimeter. This effect was studied 
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Minimum E T (MeV) 

Figure 23. The number of preclusters per event, 
as a function of minimum precluster transverse en- 
ergy Elj,. The D0 data were preclustered with 
the choice E V T = 200 MeV, which produced -200 
preclusters per event. With the preclusters treated 
as massless, Et is the same as px- This iden- 
tification is certainly appropriate for individual 
calorimeter towers. 
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in a sample of herwig Monte Carlo QCD jet events. 
The jets in the hard 2 — > 2 scattering were generated 
with pr > 50 GeV, and at least one of the two leading 
order partons was required to be central (\rj\ < 0.9). 
The events were passed through a full simulation (in- 
cluding luminosity C ~ 5 x 10 30 cm _2 s _1 ) of the D0 
detecto r. The MC sample is described in more detail in 
Section 4.4.1 . Fig. |24| shows the number of preclusters 
with AK P = 0.2 is -180, reduced by 37% from that 
obtained with A1Z P — 0. Fig. p5| shows that precluster- 
ing is necessary even at the particle level in the Monte 
Carlo, reducing the number preclusters by 24%. Com- 
paring Figs. |24| and p5|, the number of preclusters in the 
detailed detector simulation is a factor 2.4 higher than 
at the particle level for A1Z P = 0. Most of the addi- 
tional preclusters are reconstructed near the beampipe 
and some are due to localized deposits of low energy. 
With AlZp = 0.2, the number of preclusters increases 
only by a factor 2.0. 

The effect of the preclustering radius A1Z P on jets 
and jet structure was examined next. Fig. |2^ shows 
the comparison of the leading jet px with A7Z P = 0.2 
to that with ATZ P — 0. The jets were reconstructed 
with the Kt jet algorithm D = 0.5. The preclustering 
radius A1Z P = 0.2 (step 4 of the preclustering algo- 
rithm) reduces the mean jet pt by 0.7 GeV. Evidently, 
the preclustering algorithm assigns energy differently 
than the Kt algorithm. It is difficult to track exactly 
which towers end up in each jet, in part because of the 
redistribution of energy in steps 5 and 6 of the preclus- 
tering algorithm. The net effect is that some energy 
belonging to the leading jet when A1Z P = is trans- 
ferred to non-leading jets when A1Z P = 0.2. The shift 
in the leading jet pt spectrum is visible in the top panel 



Figure 24. Distribution of the number of preclus- 
ters per event, with A1Z P = 0.2 (solid), and with 
A1Z P — (dash). Taken from a sample of QCD jet 
events from MC data. The jets were reconstructed us- 
ing the calorimeter simulation, including the luminos- 
ity simulation. The preclustering radius A1Z P = 0.2 
reduces the mean number of preclusters per event by 
37%. 



of Fig. |26|, and the ratio in the bottom panel suggests 
some dependence on the jet px- Such a shift may need 
to be corrected for in the Run II experimental data, 
but will be different due to the change in calorimeter 
electronics. In Run I, a correction was not explicitly 
applied to the experimental data for this effect. In- 
stead, the theoretical predictions included the identi- 
cal preclustering algorithm used in experimental data. 
Fortunately, the particle-level result for leading jet pr 
is not as sensitive to A1Z P . This is shown in Fig. |27j. 
Note that even the particles in the Monte Carlo were 
projected into a calorimeter-like grid (Ar/> x Aij = 0.1 
x 0.1) by the preclustering algorithm. If this were not 
the case, then we would expect an even larger effect 
than illustrated in Fig. |27]. 

The jet structure, however, is more sensitive to the 
preclustering radius ATZ p . Fig. ^ shows the average 
subjet multiplicity, as a function of y cut (see Section 
4.6.1 ), in particle-level jets. There are more subjets in 
jets when A1Z P = 0, compared to when A1Z P = 0.2. 
Requiring preclusters to be separated by A1Z P affects 
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Figure 25. Same as in Fig. gj, except the jets were 
reconstructed in MC data at the particle level, with 
no calorimeter or luminosity simulation. The same 
preclustering radius A1Z P = 0.2 reduces the mean num- 
ber of preclusters per event by 24% 



the subjet structure below 
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(54) 



Again, the subjet multiplicity is increased even further 
when particles in the Monte Carlo are not projected 
into a calorimeter-like grid (A(f> x Arj = 0.1 x 0.1). 
This underscores the fact that the same preclustering 
algorithm, as well as the same jet algorithm, must be 
used in any comparisons of theoretical predictions to 
experimental data which are sensitive to internal jet 
structure at the level of the detector granularity. 

4.4. Momentum Calibration of Kt Jets at D0 

Jet production is the dominant process in pp col- 
lisions at ^/s = 1.8 TeV, and almost every physics 
measurement at the Tevatron involves events with jets. 
A precise calibration of measured jet momentum and 
energy, therefore, is of fundamental importance. Al- 
though the use of a Kt algorithm is well defined the- 
oretically, questions have recently arisen regarding the 
performance of the algorithm in a high luminosity 
hadron collider environment. 

The D0 Collaboration developed a method to cal- 



Figure 26. The top panel shows the distribution of 
the leading jet pr with ATZ P — 0.2 (solid), and with 
AlZp — (dash). Measured in a sample of QCD 
jet events from MC data. The sample was generated 
with minimum parton transverse momentum p™ m = 
50 GeV. The Kt jets were reconstructed with D = 0.5 
in the calorimeter simulation, including the luminosity 
simulation. The preclustering radius ATZ P = 0.2 re- 
duces the mean of the leading jet pr by 0.7 GeV. The 
bottom panel shows the ratio of the histograms in the 
top panel. 



ibrate Kt jets to a high level of accuracy. The de- 
tails are discussed thoroughly in Ref. [ |2|, p9|| . Here, 
we briefly summarize this work by the D0 Collabo- 
ration to illustrate instrumentation effects on the Kt 
algorithm, as well as its behavior in a high luminosity 
hadron collider. The Kt jets momentum scale correc- 
tion is largely based on the calibration of cone jets, 
extensively discussed in a recent article [ P7| . The 
derivation of the momentum scale correction is per- 
formed for Kt jets with D = 1. The measured jet mo- 
mentum, Pjll as , is corrected to that of the final-state 
using the following relation: 



particle-level jet, p^lf , 



ptcl 
Pjet 



P 



j 1 L 



Po 



j<i 



(55) 



where po denotes a momentum offset correction for un- 
derlying event, uranium noise, pile-up, and additional 
pp interactions. Rj e t is the calorimeter momentum re- 
sponse to jets. Note that the equation is missing the 
out-of-cone showering loss factor. In cone jets, this fac- 
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Figure 28. The average subjet multiplicity, as a func- 
tion of y cu t, in a sample of jets reconstructed in MC 
data at the particle level, with no calorimeter or lu- 
minosity simulation. The solid curve shows the re- 
sults with AlZp = 0, and the dashed curve shows the 
results with A1Z P — 0.2. The preclustering radius 
AlZp = 0.2 reduces the average subjet multiplicity for 
Vent < 10" 
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Figure 27. Same as in Fig. g6|, except the jets were 
reconstructed in MC data at the particle level, with 
no calorimeter or luminosity simulation. The same 
preclustering radius A1Z P = 0.2 reduces the mean of 
the leading jet px by 0.25 GeV. The bottom panel 
shows the ratio of the histograms in the top panel. 



tor corrects for the fraction of the energy of the final- 
state hadrons which is lost outside the cone boundaries 
due to calorimeter showering. This is an instrumenta- 
tion effect completely unrelated to parton showering 
losses outside the cone. There is no correction for the 
latter. Note that the important issue here is not so 
much that po be small or that Rj et be near unity, but 
rather that these parameters can be determined with 
precision. This is the question to be addressed when 
comparing jet algorithms. 

The D0 uranium-liquid argon sampling calorime- 
ters [ |30t] are shown in Fig. 29 30l They constitute the 



primary system used to identify e, 7, jets and missing 
transverse energy (jjCj,). Jpx is defined as the nega- 
tive of the vector sum of the calorimeter cell trans- 
verse energies (£t's). The Central (CC) and End (EC) 
Calorimeters contain approximately seven and nine in- 
teraction lengths of material respectively, ensuring con- 
tainment of nearly all particles except high pt muons 
and neutrinos. The intercryostat region (IC), between 
the CC and the EC calorimeters, is covered by a scintil- 
lator based intercryostat detector (ICD) and massless 
gaps (MG) [ |o). The segmentation is Ac/> x A?/ = 0.1 
x 0.1 (or smaller). 
The fractional energy resolution, oe/E, character- 



izes the suitability of the D0 calorimeter system for 
in-situ momentum calibration techniques. It is param- 
eterized with a yj S 2 /E + C 2 functional form. For elec- 
trons, the sampling term, S, is 14.8 (15.7)% in the CC 
(EC), and the constant term, C, is 0.3% in both the CC 
and EC. For pions, the sampling term is 47.0 (44.6)%, 
and the constant term is 4.5 (3.9)% in the CC (EC). 
The energy response is linear to within 0.5% for elec- 
trons above 10 GeV and for pions above 20 GeV. The 
D0 calorimeters are nearly compensating, with an ^ 
ratio less than 1.05 above 30 GeV. Due to the hermitic- 
ity and linearity of the D0 calorimeters their response 
function is well described by a Gaussian distribution. 
These properties indicate that the D0 calorimeter sys- 
tem is well suited for jet and $ T measurements and are 
the basis of the in-situ calibration method described 
here. 

4.4.1. Offset Correction 

The total offset correction is measured in transverse 
momentum and expressed as pt,o = O ue + O z b- The 
first term is the contribution of the underlying event 
(energy associated with the spectator partons in a high 
Pt event). The second term accounts for uranium 
noise, pile-up and energy from additional pp interac- 
tions in the same crossing. Pile-up is the residual 
energy from previous pp crossings as a result of the 
long shaping times associated with the preamplifica- 
tion stage in calorimeter readout cells. 

To simulate the contribution of O z b to jets, D0 Run I 
collider data taken in a random pp crossing (no trig- 
ger requirements) was overlayed on high pr jet her- 
wig [ [[l| Monte Carlo events. Jets were matched in 



25 



Inner Hadronic 
(Fine & Coarse) 




^ Electromagnetic 
Fine Hadronic 



Coarse Hadronic 



Electromagnetic 



Figure 29. The D0 liquid argon calorimeter 
is divided physically into three cryostats, defin- 
ing the central calorimeter and two end calorime- 
ters. Plates of absorber material are immersed in 
the liquid argon contained by the cryostats. Each 
cryostat is divided into an electromagnetic, fine 
hadronic, and coarse hadronic section. 



Figure 30. One quadrant of the D0 calorimeter 
and drift chamber, projected in the x—z plane. Ra- 
dial lines illustrate the detector pseudorapidity and 
the pseudoprojective geometry of the calorimeter 
towers. Each tower is of size A77 x = 0.1 x 0.1. 



this sample to jets in the sample with no overlay. The 
contribution of uranium noise, pile-up, and multiple 
interactions was determined by taking the difference 
in px between matched pairs. O ue was extracted in 
the same way from the overlap of low luminosity min- 
imum bias data (a crossing with an inelastic collision) 
on Monte Carlo events. O ue and O z b for jets with 
Pt = 30 — 50 GeV are shown in Figs. ||l] and The 
offset is derived in the central calorimeter and extrap- 
olated to higher 7/ regions. 

4.4.2. Response: The Missing Et Projection 
Fraction Method 

D0 makes a direct measurement of the jet momen- 
tum response using conservation of px in Run I photon- 
jet (7-jet) collider events [ ^7). Previously, the photon 
energy /momentum scale was determined from the D0 
Z — > e+e~, J/tp and tt° data samples, using the masses 
of these known resonances. In the case of a 7-jet two 
body process, the jet momentum response can be mea- 
sured as: 



Rjet — 1 



'rp • TlT~y 
PT~f 



(56) 



where pt 1 and n are the transverse momentum and di- 
rection of the photon. To avoid resolution and trigger 
biases, Rj et is binned in terms of E' = p™^ as -cosh(^j et ) 
and then mapped onto p" l J: as ■ E' depends only on pho- 
ton variables and jet pseudorapidity, which are quan- 
tities measured with very good precision. Rj et and E' 
depend only on the jet position, which has little de- 
pendence on the type of jet algorithm employed. 



Rjet as a function of pT^ t a!i (pKt) is shown in Fig. B3l 



ijet as a loucuoxi 01 p^ as 
The data is fit with the functional form Rj et {P) = 
a + b- ln(P) + c ■ ln(P) 2 . R jet for cone (R = 0.7) [ |7) 
and Kt (D = 1) jets are different by about 0.05. This 
difference does not have any physical meaning; it arises 
from different voltage-to-energy conversion factors at 
the cell level before reconstruction. 

4.4.3. Tests of the Method 

The accuracy of the Kt jet momentum scale correc- 
tion was verified using a herwig 7-jet sample and a 
fast version (showerlib) [ p| of the Run I detector 
simulation using GEANT [ |33[. A Monte Carlo jet mo- 
mentum scale was derived and the corrected jet mo- 
mentum compared directly to the momentum of the 
associated particle jet. Figure |34| shows the ratio of 
calorimeter and particle jet momentum before and af- 
ter the jet scale correction in the CC. The vertical bars 
are statistical errors. Systematic errors (not shown) 
are of the order of 0.01-0.02. After the jet correction 
is applied, the ratio versus particle jet pr is consistent 
with unity within the total uncertainty. 

4.4.4. Summary 

D0 improved the method introduced in Ref. [ 
for estimating the effects of underlying event, uranium 
noise, pile-up, and additional pp interactions. The off- 
set correction is larger for Kt jets with D = 1 than 
for cone jets with R = 0.7 by approximately 20-30%. 
The uncertainty (~0.1 GeV for underlying event, and 
~0.2 GeV for the second offset term in the CC), how- 
ever, is slightly smaller. A Kt (D = 1) algorithm 
reconstructs more energy from uranium noise, pile-up, 
underlying event, and multiple pp interactions than a 
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Figure 31. Physics underlying event offset O ue ver- 
sus rj. Above rj — 0.9, the result is an extrapolation. 



cone algorithm (R = 0.7). The accuracy of the associ- 
ated corrections are, however, on the same order. The 
missing Et projection fraction method is well suited 
to calibrate Kt jets [ The uncertainty in Rj et for 
Kt and cone jets is about the same: (0.5-1.6%) for jet 
p T = 50-450 GeV in the CC. 

Overall, it may be possible to determine the jet mo- 
mentum scale more accurately for Kt jets than the 
energy scale for cone jets, given the absence of a cone 
boundary in the former. The difference in precision 
could be large in the low pt and high pseudorapidity 
range, where the cone showering correction is larger 
and more inaccurately determined. (The showering 
correction uncertainty contributes 1-3% [ Q to the 
total error for R = 0.7 cone jets.) 




0.5 1 1.5 2 2.5 



V 

Figure 32. Offset due to uranium noise, pile-up 
and multiple interactions, O z b versus r\ for different 
luminosities in units of 10 30 cm _2 sec _1 . Above rj 
= 0.9, the result is an extrapolation. 

{D = 1) and cone {R = 0.7) jets at 60 GeV, and 0.10 ± 
0.01 for both at 80 GeV. Preliminary measurements of 
Kt jet momentum resolutions and cone jet energy res- 
olutions using Run I collider data support the previous 
statement. Note, however, that resolutions depend on 
the algorithm parameters R and D. Resolution stud- 
ies for different (smaller) R and D parameters should 
be performed, as well as for different type of samples, 
for example quark or gluon enriched samples. These 
studies will make more clear how energy /momentum 
resolutions compare for cone and Kt jets. 



4.5. Jet Momentum Resolutions of Kt Jets 

One of the largest sources of uncertainty in jet mea- 
surements (besides the jet momentum scale) is the ef- 
fect of a finite calorimeter jet momentum resolution. 
A priori, due to the absence of cone boundaries, Kt 
jets should be affected little by jet-to-jet fluctuations 
in the shower development. The jets will, of course, 
still be subjected to the effects of hadronization. 

We compared jet energy resolutions for cone jets 
(R = 0.7) and momentum resolutions for Kt jets 
(D = 1) derived from a D0 Monte Carlo simulation 
using the herwig event generator plus the GEANT sim- 
ulation of the D0 detector (Run I) . The test was per- 
formed for an inclusive jet sample with average pt = 
60 GeV and 80 GeV in \r]\ < 0.5. Within statistical 
errors, <7 Pt /pt for Kt jets and <te t /Et for cone jets 
are the same: 0.109 ± 0.009 and 0.105 ± 0.006 for K T 



4.6. Testing QCD with the K T Jet Algorithm 
4.6.1. Jet Structure 

The subjet multiplicity is a natural observable of a 
Kt jet [ EiBL p6j. Subjets are defined by re-running 
the Kt algorithm starting with a list of preclusters 
in a jet. Pairs of objects with the smallest dy are 
merged successively until all remaining dij are larger 
than y C utE T (jet), where < y cu t < 1 is a resolution 
parameter. The resolved objects are called subjets, 
and the number of subjets within the jet is the subjet 
multiplicity M. For y cut = 1, every jet consists of a 
single subjet (M =1). As y cut decreases, the subjet 
multiplicity increases until every precluster becomes 
resolved as a separate subjet. At this level of detail 
the specific preclustering algorithm used clearly influ- 
ences the result. A measurement of M for quark and 
gluon jets is a test of QCD, and may eventually be used 
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Figure 33. Rj e t versus Kt jet momentum. The 
solid lines are the fit and the dashed band the error 
of the fit. (The three lowest points have nearly fully 
correlated uncertainties and are excluded from the 
fit.) 
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Figure 34. Monte Carlo verification test. The ver- 
tical bars are statistical errors. Systematic errors 
(not shown) are of the order of 0.01-0.02. The 
corrected p^ as /p^lf ratio is consistent with unity 
within errors. 



in Run II as a discriminant variable to tag quark jets 
in the final state. Fig. |35| s hows a preliminary mea- 
surement of M by D0 [ |37|, using Run I data (Kt 
algorithm with D — 0.5 and y cu t = 0.001). The ratio 
R = \m 9 \z\ is 1.91 ± 0M(stat) ± 0.23(sys). It is well 
described by the herwig Monte Carlo, and illustrates 
the fact that gluons radiate more than quarks. 

4.6.2. Jet Production 

Jet cross section measurements have been exten- 
sively used by both Fermilab Tevatron collaborations 
during Run I to test perturbative (NLO) QCD pre- 
dictions, to test the available parton distribution func- 
tions at the x and Q 2 ranges covered by the Tevatron, 
and to search for quark compositeness [ |3j| |3J^, |4(], [II], 
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f«| |7j. The higher center-of- mass en- 
ergy and the larger data sample will allow the Tevatron 
experiments to extend the energy reach and precision 
of jet cross sections in Run II. The largest source of 
uncertainty in a jet cross section measurement is the 
jet energy (or momentum) scale. As an example, a 
1% uncertainty in the jet energy calibration translates 
into a 5-6% (10-15%) uncertainty in the \rj\ < 0.5 in- 
clusive jet cross section at 100 GeV (450 GeV). As a 
function of rj, the jet cross section falls more quickly 
with transverse energy, and the cross section error is 
even larger. 

The Kt jet algorithm may provide experimental ad- 
vantages for jet production measurements. At D0, the 
jet scale uncertainty for cone jets in the high Et range 
is dominated by the contributions from the response 
and out-of-cone showering corrections. In Run II, the 



availability of more high Et photon data and a more 
accurate determination of the position of the interac- 
tion vertex promise a reduction in the response uncer- 
tainty. Furthermore, the absence of out-of-cone show- 
ering losses in Kt jets will likely lead to improved jet 
cross section measurements in the forward rj regions. 
Most of the Run I cross section results by CDF and 
D0 use jet energy measurements restricted to central 
regions (\r]\ < 1). A couple of exceptions to the rule 
are the D0 measurements of the pseudorapidity de- 
pendence of the jet cross section [ and the test of 
BFKL dynamics in dijet cross sections at large pseu- 
dorapidity intervals [ Q . 

4.6.3. Event Shapes 

Event shape variables in e + e~ and ep interactions 
have attracted considerable interest over the last few 
years [ |4i| [5(], |5l| . Little attention has been paid to 
measurements or calculations of event shape variables 
at hadron colliders. An important example is thrust 
which is defined as: 



T 



E 2 \fi 



(57) 



where the sum is over all parton, or particle momenta. 

A LO jet rate calculation with two partons in the fi- 
nal state yields T = 1. A NLO calculation, with three 
partons in the final state would produce a deviation 
from T = 1 (LO in thrust). A NNLO prediction with 
four partons in the final state would then give a NLO 
prediction of thrust. At all orders, thrust would take 
values from 0.5 to unity. In other words, thrust mea- 
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Figure 35. Subjet multiplicity for quark and gluon 
jets at D0. 

sures the pencil-likeness of the event: T — > 1 for back- 
to-back events, and T < 1 as more radiation is present. 
The low scales introduced by soft and collinear emis- 
sion in events with T < 1 could be the reason for the 
observed discrepancy between LO and NLO calcula- 
tions and experimental e + e~ data [^J. Resummation 
of higher-order perturbative terms could lead to a bet- 
ter understanding of the problem. 

The simplest measurements of thrust we can perform 
are the thrust distributions in jet events, changing the 
definition of thrust to sum over all the jets in the event. 
In order to be able to resum logarithms of the jet res- 
olution scale, jets must be defined using an algorithm 
such as the Kt algorithm [ [52). The contribution of 
the underlying event, and multiple pp interactions in 
hadron colliders, introduce an experimental difficulty 
not present in lepton colliders. It is possible, however, 
to minimize these systematics by choosing carefully the 
variable to measure. 

We can also define transverse thrust, Tt, by replac- 
ing particle momenta by transverse momenta in Eq. |57]. 
Tt is Lorentz invariant for boosts along the beam axis, 
an advantage in the case of hadron colliders. 

Figs. p6|-|38| show the difference between Tt calcu- 
lated from particle-level jets (reconstructed from final- 
state hadrons) and Tt from calorimctcr-lcvcl jets (re- 
constructed from cells), herwig was used as the 
generator, and SHOWERLIB [ ^] (a fast version of 
GEANt) simulated the Run I detector. In all cases 
jets are reconstructed with the Kt jet algorithm (D = 
1). Fig. ^ shows a Tt distribution for events with 
Ht = 90-150 GeV, where Ht is the scalar sum pt of 
all jets above 8 GeV. Ht was chosen instead of Q 2 as 
an estimator of the hard scattering energy scale of the 
event. All jets with pt > 8 GeV contribute to Tt- 



The full circles are the particle-level or "true" distri- 
bution. The triangles are the distribution as seen in 
the calorimeter in an ideal environment with no off- 
set (underlying event, multiple pp interactions, pile- 
up, or noise). The open circles are a calorimeter-level 
distribution which includes a random collider crossing 
event at a luminosity of 5 x 10 30 cm. _1 sec _1 . While 
the effect of calorimeter momentum response, resolu- 
tion, and showering is minimal, the offset distorts the 
distribution to a large extent. 
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Figure 36. All jets with pt > 8 GeV contribute 
to Tt- The full circles are the particle- level or 
"true" distribution. The triangles are the distri- 
bution as seen in the calorimeter in an ideal envi- 
ronment with no offset (underlying event, multiple 
pp interactions, pile-up, or noise). The open circles 
are a calorimeter-level distribution which includes 
a random crossing collider event at a luminosity of 
5 x 10 30 cm~ 2 sec _1 . 

In Fig. |37], the thrust definition was modified to al- 
low only the three leading jets (above 8 GeV) to con- 
tribute to the thrust (Ttz) and to Ht (now Hts)- 
The difference between the true and the fully-simulated 
calorimeter distribution is now much smaller. Finally, 
in Fig. [38], only the two leading jets contribute to the 
thrust (T T2 ) for events with H T 3 = 90-150 GeV. Now 
the calorimeter distribution is even closer to the true 
distribution. Although Tt3 and Tti are not calculated 
from all final-state particles (to reduce contamination) , 
they implicitly include the information about the whole 
radiation pattern through the pt and rj — <p position of 
the first few leading jets. 

Event shape variables, like a modified version of 
thrust, can be studied with precision at the Tevatron. 
The use of the Kt algorithm, infrared safe at all orders 
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Figure 37. Same as Fig. |36| but only the three lead- 
ing jets contribute to Tt, now Tt3- Ht3 is the 
scalar sum pt of the three leading jets in the event. 
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Figure 38. Same as Fig. |36| but only the two leading 
jets contribute to Tt, now Tti- Ht3 is the scalar 
sum pt of the three leading jets in the event. 



in perturbation theory provides a test of the newly 
available hadronic three jet production calculations at 
NLO [ |, |§. In the QCD calculation of the thrust 
variables defined in this section, there are no large log- 
arithms in the T — > 1 limit. Then, it is neither possible 
nor necessary to resum them. However, if we redefine 
thrust in terms of subjets or tracks, the measurement is 
more interesting and resummation becomes an issue [ 
[34). The availability of the contributions of higher- 
order terms through a resummation calculation would 
be desirable, in that case, to improve the understand- 
ing of the range T < 1. In Run II, both the CDF 
and the D0 detectors will have upgraded tracking sys- 
tems. This will allow both experiments to implement 
improved techniques for the identification of hadrons 
using both the calorimeter and the tracking detectors. 

The Ht dependence of (1 — T), in the range where 
resummation and hadronization effects are small, could 
also provide a measurement of a s . 

5. Conclusions 

Jet algorithms present a challenge to experimental- 
ists and theorists alike. Although everyone "knows a 
jet when they see it," precise definitions are elusive and 
detailed. The jet working group has attempted to pro- 
vide guidelines and recommendations for jet algorithm 
development. The end product of the year- long effort 
has been standardized jet cone and Kt algorithms, and 
the recommendation to use 4- vector, E-scheme kine- 
matic variables. A legacy algorithm or ILCA has been 
suggested which will bridge the gap between past re- 
sults and improved theoretical calculations. This doc- 
ument has addressed concerns about the use and cali- 



bration of Kt jets. 

We strongly recommend that both CDF and D0 
adopt standard algorithms for Run II. Since contin- 
ued development is probably inevitable, we encourage 
continued dialogue. The usefulness of standardized al- 
gorithms, which can replicate past results and meet 
experimental and theoretical requirements, makes con- 
tinued coordination well worth the effort. 
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